Gene GM21_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3804 
SymbolprfA 
ID8139178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4375650 
End bp4376729 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content63% 
IMG OID644871423 
Productpeptide chain release factor 1 
Protein accessionYP_003023581 
Protein GI253702392 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones114 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGT TTGAAAAAAT AGCCGAGCTG GAAAGACGTT TCGAGGAGCT GGAATCGCTG 
CTGTCGGATC CCGAGGTGCT GGCGAACCAG ACCGAGTTCA GGAAGCTCTC CAAGGAGCAT
TCCGGCCTGG CCGAGCTGGT TGCCGCCTAT CGCGAATACA AGAAGATACT CTCCGACATC
GACGACAACA AGGAACTCCT GAAGGAGCCG GACCAGGAGA TGCGCGAGAT GGCCCAGGCC
GAGCTTTTGT CCCTGGAGGC GCGGCGCGAG GAGCTGGAAG GCGAGATCAA GTTGCTGCTC
CTGCCCAAGG ACCCCAACGA CGACAAGAAC GTGGTGCTCG AGATCCGCGC CGGAACCGGC
GGAGACGAGT CCGCGCTTTT CGCGGGGGAC CTGTTCCGCA TGTACTCCCG CTTCGCCGAG
ACCAACCGCT GGCGCGTCGA GATCATCTCA GCCTCCGAGT CGGAGAAGGG GGGCTTCAAG
GAGGTCATCG CGCTAGTCGA GGGGACCGGG GTCTTCGCGA AGCTCAAGTA CGAGTCGGGG
ACCCACCGCG TGCAGCGCGT TCCCGAGACC GAGGCGCAGG GTCGGATCCA CACCAGCGCC
TGCACCGTCG CGGTCATGCC CGAGGCCGAA GACGTCGACA TCGACATCAA CCCCGCCGAC
CTGAAGATCG ACGTGTACCG TTCCTCCGGT GCCGGGGGGC AGCACGTCAA CACCACCGAC
TCCGCGGTCA GGATCACCCA TCTCCCCACC GGGACCGTGG TTGCCTGCCA GGAAGAGCGG
AGCCAGATCA AGAACCGCGC GAAGGCCATG AAGGTGTTGA AGTCCAGGAT CCTGGACAAC
ATCCTCATGG AGCAGAACGC GAAGCTCGCC GCCGACCGCA AGAGCCAGGT CGGAAGCGGG
GATCGCAGCG AGCGCATCAG GACCTACAAC TTCCCGCAGG GGAGGATGAC CGATCACCGG
ATCGGCCTGA CCCTGTACCG TTTGGACGCC ATCATGGCGG GCGACATAGC CGAGATCGCC
GACTCCCTGC GTGCCCATTA CCAGATGGAA GCGCTGCAGG CCCAGAGCGA GGGGATGTAG
 
Protein sequence
MSMFEKIAEL ERRFEELESL LSDPEVLANQ TEFRKLSKEH SGLAELVAAY REYKKILSDI 
DDNKELLKEP DQEMREMAQA ELLSLEARRE ELEGEIKLLL LPKDPNDDKN VVLEIRAGTG
GDESALFAGD LFRMYSRFAE TNRWRVEIIS ASESEKGGFK EVIALVEGTG VFAKLKYESG
THRVQRVPET EAQGRIHTSA CTVAVMPEAE DVDIDINPAD LKIDVYRSSG AGGQHVNTTD
SAVRITHLPT GTVVACQEER SQIKNRAKAM KVLKSRILDN ILMEQNAKLA ADRKSQVGSG
DRSERIRTYN FPQGRMTDHR IGLTLYRLDA IMAGDIAEIA DSLRAHYQME ALQAQSEGM