Gene GM21_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3850 
Symbol 
ID8139224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4432996 
End bp4434441 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID644871467 
ProductGTP-binding signal recognition particle SRP54 G- domain protein 
Protein accessionYP_003023625 
Protein GI253702436 
COG category[N] Cell motility 
COG ID[COG1419] Flagellar GTP-binding protein 
TIGRFAM ID[TIGR03499] flagellar biosynthetic protein FlhF 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0000866601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAGTTA AAACCTTTCA GGCAGGCGAG ATGTCGGAGG CTTTACGGAT GGTGAAGGCC 
GAGATGGGGC TGGACGCCAT GATCCTCTCC TCGAAGAAGG AGCGGAAGAA GGGGTTCCTT
GGCTTCTTCT CCAAGCCGTA CTACGAGGTG ACCGCGGCGC TGGAGCCGAG ACAGCCCCAG
CCGCGTCACA ACCCCTACCG CGAGGAGGCT CCTCCGGCGC CCCCCGAGCG CGAACTCTCC
ACCCGCGAGG AGTTCCAGAA CTCCATGCTG GGACCGCTGG CGCGGGAGGT GCGCGAGCTG
AAGCAGCGCC TCGAAGCGCT TGCGAAAAAA GAGGCTGCGG CGCCGCAGCA GATGCAGCCG
GCACCCGAGC CGGTTGTCGC CGAGCCATCC TCCCCCCCCA GGACCTTCGC CAAGGAGGAG
CTGGAGGAGA TCAAGAAGCT CCTGTACAGC GCGGTCTCGG GGAAGGAGAA AGAGCCCAGG
CTGGCAACCT TCCCTCTCGC CGGGGGGGGG GGCGCCGAAA CCACCGCCGC TAAAAGCATG
GCGGCCCAGC TGCAGCAGCA GCTCGAGGTG CTGAGCGTGC CGCCGCTGCC GCCGGTAAAA
GAGGCCACGC TGGTGCAGGA GGTCCGGGAG GCCAAGCAGA GCGCCCGCGC GAGCGAAGGG
GAACTCCTGC TCGACGCGCT TGCCGCCGAG CTGCAGGGGG AGGACGTAGG TCCCGCGACG
ATCGAGCTCC TCATGGAGGC GATCAGGCCC GCGGCCCGCG GCGGCGCCGG CATCGGCGAG
CTGAGAAACT TGATGTCGGA GGCGCTCGCC GGGATGATCA AGTGCTCAGG CTCCTTGCGC
ATCAAAAAGA CCGGTCCGCG CATCGTCGCG GTAGTCGGCC CCACCGGAGT GGGTAAGACC
ACGACCATCG CCAAGATCGC CGCCCTTTAT GCCTTGAACC GCCGCGTCTC GGTCGCCATG
GTGACCATGG ACAACTTCAG GGTGGGCGCG GTGGAGCAGC TTAAGACCTA CGCGAAGATC
ATGGACCTGC CGCTGGAGGT GGCCGGCAAC TCCCAGGAAC TCGGCAAGGC GCTCGCCAGG
CACTCGGACA AGGACCTGAT CATGATCGAC ACCGCCGGGA GAAGCCCCAA GGATTCCGAA
CGGTTGGACG AGCTGAAGGG CTACCTGGAG GCGCACAACG GCATCGACGT CTACCTCTGC
CTCTCCGCCA CCACCAGGAC CCGCGAGATC GACGAGATCA TAGCGACCTT CGGGACGCTG
CCGATCACGA AGCTTCTCTT CACCAAGCTG GACGAGAGCA GGAGCCTTGG CTGCATCGTC
GACACCTATC TGAAGCACAA GGTTCCCCTT TCCTATTTCA GCACCGGCCA GAAGGTGCCC
GAGGACATCG AGGTGGCTAA CTCTCGCAAA CTCGCCTCTC TGGTGGTGCA GGAGTCAACA
AGATGA
 
Protein sequence
MLVKTFQAGE MSEALRMVKA EMGLDAMILS SKKERKKGFL GFFSKPYYEV TAALEPRQPQ 
PRHNPYREEA PPAPPERELS TREEFQNSML GPLAREVREL KQRLEALAKK EAAAPQQMQP
APEPVVAEPS SPPRTFAKEE LEEIKKLLYS AVSGKEKEPR LATFPLAGGG GAETTAAKSM
AAQLQQQLEV LSVPPLPPVK EATLVQEVRE AKQSARASEG ELLLDALAAE LQGEDVGPAT
IELLMEAIRP AARGGAGIGE LRNLMSEALA GMIKCSGSLR IKKTGPRIVA VVGPTGVGKT
TTIAKIAALY ALNRRVSVAM VTMDNFRVGA VEQLKTYAKI MDLPLEVAGN SQELGKALAR
HSDKDLIMID TAGRSPKDSE RLDELKGYLE AHNGIDVYLC LSATTRTREI DEIIATFGTL
PITKLLFTKL DESRSLGCIV DTYLKHKVPL SYFSTGQKVP EDIEVANSRK LASLVVQEST
R