Gene GM21_3916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3916 
Symbol 
ID8139290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4501114 
End bp4503084 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content68% 
IMG OID644871533 
Productflagellar hook-length control protein 
Protein accessionYP_003023691 
Protein GI253702502 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.108844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATAG CAAACGCAAC ATTTGTAACA GAAGCCGTAC CGGTCGCCGC AACGGCGCCT 
GGATCGGGAA CCGCCGCGAC GGCGGGGGAA GGGGGTGGCC TCTTCCAGCA ACTGCTGCAG
GGGATGCAAC CGGCAGGCTC CGGGGAAACG GCAGCCGCGG CCACGGCAGT TCCCGCGGCG
CAGACGGAAG CCGGCGCTAC TAAAGCGGAG GTCGCGGAGC AGATGGCCGC GGCAAACCTG
GTCCTGCCCG AAGCGGCGCA GGAAATGGCT CCGGTGAACC CGGCTCCGTC AAGGTCCGCC
ACCGGCGGCG CCAGGCTGCA GCTGGTGATG ACCTGGCAGG CGCTGAAGAG CGACCTGGCG
GCAAAGCCGG AACTGGGGGG GATCCCCGCC GGTGCGGAAA CGGCAACGGC CGCGGCTGAA
GTGGCGGCGG CAGGGGAGAT GCTGCCGGAA GTGAAAACCA CGGACGGCGA AGGGGATGAC
GAGGCGCCGG TGCAGGCGGC TTCGCCCAAC ACGGCCGGGG TGGAACTGGC GGCGGCAGTA
GTGGCGCAAT CCGCCCCCAA GGCGGCAGCA GTGGAAGAGG GCGCGGTACC GCGCGGGCTG
GAAGTAGCCG CGGGGAAGGT GGAGGCCGCC CGGGAGCGCA GAGGTACCGA GGCCCCGGTT
GCAGCGGGCG CCAACGCGCT CTCAAGGTTG GAAGAACTGC AGCAAAGGGC AATGGAGCAA
CTAGCCGCCA AGGCGCAGAC GGTGCAGCGA CCAGCAGAAA ACGCGGTTGC ACCGGGAGCG
GACCAACAGG AACGGCTCGC AGCAATCACT GCCGGATTGG AGCTCGACCA GAAGGTGGAG
CAGGCAGAAC TCCCGCAGGA AAAAGGGAGA GCCTTGACAG GCGCCACAGA AGGGGCGCAG
CAGGCGAAAG CGCCTCAGCC GGCACTTGAG GCCGATGGCG CTGCGCGGCC TGGCCTTTCG
GCGCAACCCG CTCCGGAAGC AGGGGCGAAA ACCGCGGCCG CAGGGTTCGT GGCAACGCCT
TTGAAGGGTG CCGCACCGGA AGTCTTGGCC CCGGTCGGCG AACAGGATGC AACGCAGGAG
CAGCAGGCTT CGAACGGCAG CAATCCCGGG ACTGCCGAGG TGACGGCCCA GGCCGCCCCC
AAGGCGAAGG TTCAGGGCGA GGTACTCCCC CCGCGGCAGG GGGCGAACCC CGAGGCGCCC
CGCCCCGAGG CGGCCCAGGG CAACGAAAGG ACGACGCAGA GAAGGGAGTC GCAGCACGGG
GGCGAAAAAA CCGTCCCGCT TGAGGGAGCT GAGAGCGCCG GCCAACCGGC GGCGGCCGGC
GCGGCCGCAC AGGATCTCTC CGGCGCCAAG GGAGCCCCGG TCGTATCCGC CGCAATAACC
CCCGGAGAAC TGCGCGGGGC AGAGGGGTCC CAGTTCAAGG AGCAGGGACA CCGTCAGCAA
GGGCAGGAGG GGCAAAACGC TCAGCTTCAA GGCGCCGCGG TAGGGGCGCA GGGGAGCACG
GCGGAAACGG CGGCGCCCGA ATCCCACCAG AGCGCCACCC GCAGCGCGCT GCATGAGCAT
ATCCTTTCCC AGGTGAAGGA AGGGGTGGTG ACCCATGACG GCAAAGGAAA CGGCCAGATG
AGCATCAGGC TCAACCCGGG GGAACTGGGC GAGCTGAAGA TCCAGGTGCG CATGGAGGAC
AACCGGCTCA AGGTCGAGGT CCAGGCGGAC AACCGCATGG TCAAGGACCT GCTGATGAGC
AACCTGGACT CCCTGAAGGA GGCCCTTTCC GGCAAGAATC TCGCCATGTA CGGGTTCAAC
GTCTCCACCG GCAGCGGCGG TTTTCAGCAG CCGCTTTACG AGGAGAGGGG GAACCAGCGG
CAGCAATCCG CCTCCAGGTT CGCCAGGGGG GGAGGGTACG ACGCTCCGCA GGAGACCCGA
GTCAATTATC TGACGGCGGA GGTCAACAAC CTGCTGGACG TGAGATTCTG A
 
Protein sequence
MMIANATFVT EAVPVAATAP GSGTAATAGE GGGLFQQLLQ GMQPAGSGET AAAATAVPAA 
QTEAGATKAE VAEQMAAANL VLPEAAQEMA PVNPAPSRSA TGGARLQLVM TWQALKSDLA
AKPELGGIPA GAETATAAAE VAAAGEMLPE VKTTDGEGDD EAPVQAASPN TAGVELAAAV
VAQSAPKAAA VEEGAVPRGL EVAAGKVEAA RERRGTEAPV AAGANALSRL EELQQRAMEQ
LAAKAQTVQR PAENAVAPGA DQQERLAAIT AGLELDQKVE QAELPQEKGR ALTGATEGAQ
QAKAPQPALE ADGAARPGLS AQPAPEAGAK TAAAGFVATP LKGAAPEVLA PVGEQDATQE
QQASNGSNPG TAEVTAQAAP KAKVQGEVLP PRQGANPEAP RPEAAQGNER TTQRRESQHG
GEKTVPLEGA ESAGQPAAAG AAAQDLSGAK GAPVVSAAIT PGELRGAEGS QFKEQGHRQQ
GQEGQNAQLQ GAAVGAQGST AETAAPESHQ SATRSALHEH ILSQVKEGVV THDGKGNGQM
SIRLNPGELG ELKIQVRMED NRLKVEVQAD NRMVKDLLMS NLDSLKEALS GKNLAMYGFN
VSTGSGGFQQ PLYEERGNQR QQSASRFARG GGYDAPQETR VNYLTAEVNN LLDVRF