Gene GM21_0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0279 
Symbol 
ID8135586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp338775 
End bp341078 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content61% 
IMG OID644867899 
Productputative phytochrome sensor protein 
Protein accessionYP_003020121 
Protein GI253698932 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATG CAGATCTGCT CAAAGAGCAA CTCGAACACC GCAAACGGTT CATGGAGAAG 
ATCAATGAGA TACATTCCTC CGGCAACCTG AACACCATAC TGATACAGCT TAAGGACAGC
ATCGCCGCGC TGTTCAAGGC GGAGCGGCTG ACCATTTACG TGATCGACCA GAAACAGAAG
ATGCTCATGT CGAAGGTGAA ATCGGGGGAG GAGATCCAGC AGATCACCGT CCCCATCGGC
AGCAACAGCC TCGCCGGTTT CTGCGCGCTC TCCGGAACGC TGTTGAACAT CAGGGACGCC
TACGATACCC ACGAGCTGAA GATGATCAGC TCTGGGTTGA GCTTCGACGT CACCTGGGAC
AAGAAAAGCG GCTTCCGCAC CAAGCAGGTG CTCTGCGTCC CGATGAAGTT CAACAACCAG
ATGATCGGCG TGATGCAGCT CATCAACAAG AAGGTGGGCG GCGCCTTCGA CGATACGGAC
CTAAGCTACG CCACAGAGCT CGCCACCTCC CTTTCCATCG CCATCCACAA CATCTTCCGT
CTCGCCGCCT CGGCGAAGCT GATCCGGAAC AAGTCGCGCT ACAACTACCT TTTGGACAAG
AACCTTCTGG AGCAGGCCGA CATCCAGAAG GCGACGGCGC ATCCGGACGT CGCCACCATA
GGCCTAGACC ACGTGTTGAT GCGGGACTGC AACATCCCGC GCGAGGAGAT GATCAAGTGC
CTGAGCCTCA ACTTCGGCAC CGAGTTCGTC TCCTTCGATC CGGACCTGCC GCGCCTGGAC
GACGTGCTGA GAAAGACCAA GCCCGACCGC CTCAAGAAGG AGCGCTGGGT CCCGGTCAAG
ATCGATCACA GCGTGCTGCA GGTGGCCATG GAGGACCCGA CCGACCTCGC CAAGCAGGAC
CTGATCAAGT TCGTCTTCCA GGAGTACAAG AGGATCAGCT TCGTGGGGGC CTTCAGGGAG
GACATAGGGA AGTTCATCGA CCACTACTAC CATCTTGCCG CGGAGAGCGC CGAGGGGCCG
ACCGCCAGCA TCTCCGACCT GTTGAGCAAG CTCGACAACG ACGATGACCC GGCGGTCGAG
CAGGAGGTGC AGAAGGTCTC CGAGCAGGAC AGCGTCATCG TCCAGTTGGT CAACAAGATC
ATCTGCGAAG CGCACGAGAA AAACGCCTCC GACATACATA TAGAGCCGGC GCCGGGGCGC
GAGGACGTGA CGGTGCGCCT GAGGATCGAC GGCAGGTGCG CGGTGTACCA GCGCATCCCC
TACAAGTACA AGCACGCCCT GACCTCCAGG ATCAAGATCA TGGCGGGGCT CGACATCGCG
GAGCGCAGAA AGCCGCAGGA CGGCAAGATC GACTTCAAGA AGTTCGGCCC CAAGGACATC
GAGCTCAGGG TGGCGACCCT CCCGAGCGCC GGGCAACTGG AGGACGTGGT GCTCAGGGTC
CTCGCCTCGG GCGAACCGAT CCCGTTCGAC AACCTGGGAC TCACCGAGCG CAACCGGCGG
GTGTTCACCG ACTGCATCAA GAAGCCGTAC GGGCTGATCC TGGTGGTGGG CCCGACCGGC
AGCGGCAAGA CCACGACGCT GCACTCCGCC GTCTCGGTGA TCAACACCCC GCAGACCAAG
ATCTGGACCG CTGAGGACCC GATCGAGATC ACTCAGAAGG GGCTCCGGCA GGTGCAGGTG
AACGCCAAGA TCGGCTTCAC CTTCGCGACC GCCCTGCGCG CCTTTTTGCG CGCCGACCCG
GACGTCATCA TGGTGGGGGA GATGCGCGAC GAGGAGACCG CCTCGATAGG GGTGGAATCC
TCTCTTACCG GCCACCTGGT CTTCTCGACG CTGCATACCA ACTCGGCGCC GGAGACGGTG
ACGAGGCTCC TCGACATGGG GCTCGACCCG TTCAGCTTCT CCGACGGGCT TCTTTGCATA
CTGGCGCAGC GGCTGGCCCG CAGGCTCTGC GGCGCCTGCA AGGAAAGCTA CCGCCCGGCG
GAAGAGGAGT TGAAAGAGAT CGCGGCCGAG TACGGCGAGG CGGAGTTCGC GGCGCTCAAG
ATGGATCCCT CCGCCATAAC GCTTTCCAAG GCCAAAGGAT GCGCCAAGTG CAACGGCTCC
GGGTACAAGG GGAGGCTCGG CCTCCACGAG ATCCTCGATT GCGGGGACCA GATGAAGGCG
CTCATCAAGA TGAAGGCCGA GGTGGCGGAT TTGAGGAAGC AGGCGATCGC CGACGGCATG
ACCACGCTGA AGCAGGACGG CATACTGAAA TGCTTCCAGG GGCTCACCGA TATTCATGAA
GTGCGGCGAG TCTGCATCAA GTAA
 
Protein sequence
MPDADLLKEQ LEHRKRFMEK INEIHSSGNL NTILIQLKDS IAALFKAERL TIYVIDQKQK 
MLMSKVKSGE EIQQITVPIG SNSLAGFCAL SGTLLNIRDA YDTHELKMIS SGLSFDVTWD
KKSGFRTKQV LCVPMKFNNQ MIGVMQLINK KVGGAFDDTD LSYATELATS LSIAIHNIFR
LAASAKLIRN KSRYNYLLDK NLLEQADIQK ATAHPDVATI GLDHVLMRDC NIPREEMIKC
LSLNFGTEFV SFDPDLPRLD DVLRKTKPDR LKKERWVPVK IDHSVLQVAM EDPTDLAKQD
LIKFVFQEYK RISFVGAFRE DIGKFIDHYY HLAAESAEGP TASISDLLSK LDNDDDPAVE
QEVQKVSEQD SVIVQLVNKI ICEAHEKNAS DIHIEPAPGR EDVTVRLRID GRCAVYQRIP
YKYKHALTSR IKIMAGLDIA ERRKPQDGKI DFKKFGPKDI ELRVATLPSA GQLEDVVLRV
LASGEPIPFD NLGLTERNRR VFTDCIKKPY GLILVVGPTG SGKTTTLHSA VSVINTPQTK
IWTAEDPIEI TQKGLRQVQV NAKIGFTFAT ALRAFLRADP DVIMVGEMRD EETASIGVES
SLTGHLVFST LHTNSAPETV TRLLDMGLDP FSFSDGLLCI LAQRLARRLC GACKESYRPA
EEELKEIAAE YGEAEFAALK MDPSAITLSK AKGCAKCNGS GYKGRLGLHE ILDCGDQMKA
LIKMKAEVAD LRKQAIADGM TTLKQDGILK CFQGLTDIHE VRRVCIK