Gene GM21_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2100 
Symbol 
ID8137436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2442238 
End bp2444493 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content60% 
IMG OID644869715 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003021910 
Protein GI253700721 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.0262959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCTT CACGACTACG CGACATGCCG CTTTACAACA GCAGGATCAT CAACTCCTAC 
ATCCTGCTGA TAAAGCACAA GTACAGCCAC GTGGACATCT CCGAGCTGCT CCAGTACGCC
GGGATGAAGG AATACGAGGT TGCCGACCAG GCGCACTGGT TCAGCCAGGA GCAGATCGAC
AGGTTCCACG AGAAGCTGCA GCAGATGACC GGCAACGCGA GGATCTCGCG GGAGGCCGGG
CGCTACGCCG CATCCCCGGA CGTTTTGGGG GCGATGAGGC AGTACATCCT GGGGCTCGTG
GATGCCTCCA GCACCTTCGA CATCATCAGC AAGACCACCG CCAAGTTCAC CAGGTCCTCC
AGCTACCAGT CGCGCTCCAT CGCCTCCAAC CAGGTCGAGA TAACCGTCAC CCCCTACGAG
CCGGGGTTGG AGAAGCCGTA CCAGTGCGAA AACAGGATCG GCTTTTTCGA GGCGATCGTG
CTTGTCTTCA ACCACAAGAT GACCGACCTG CAGTACTTCC CCGAGATAAG AAACCTGCCG
ACCATCGAGC ATCCCGAATG TATGTTCAGG GGGGATCCGG TCTGCCGCTA CGTCGTCACC
TGGGAGAAAA CCCTTTTCAC CTTCCTGAAA AAGATGAGGA ACATCCTGGC ACTCCCCCTG
GCGGCGCTGA ACCTGGCGCT TTTGGCGGTG GGGGAGGCGG GGCTTTTGAC CTGGGTCTTC
CCGTCGAGTC TCTGCGTACT GCTACTCGTC GCCCTGGCGA CGGAGACCAG CGAGAAAAAG
GCGATCAAGG AGAGCCTTTG GAGCACCCGC GACTCCATCG AGAACCTCCT GGACCAGATC
AACCTCAACT ACAACAACGC CATGCTCACC CACGAGATCG GGCAAAGGCT CGGCAACTAC
ACCAGGATCG AGGAGATGCT CTTCGACGTG GCCCAGATCA TGAGGTACCG GCTGGAATAC
GACCGCGGCA TGATCCTTTT GGCCGACGGC GGCAGAAAAC GCCTGGAGCT GCGCGCAAGC
TACGGCTACA AGGACGAGGA ACTGGCCTGC CTCAACTCGC TCCCGTTCCT TCTGGAAAAC
CGGGAGCAGG GGGACGTCTA CCTGGAATGC TTCCGGGGGC AGCGGCCGTT TCTGGTGAAC
GACCTCTCCA TCGGCAGCGT GGGCGAAAAT ACACTCGCCT GCGCCCACAT GAGCGGCACA
CGCGCCTTCA TCTGCTGCCC CATCGTTGCC GACGGCGCTT CCCTCGGGGT CCTGACCGTG
GAAAACGTGC AGGTGAAAAG GCCGCTGCTG GAAAGGGACA TCAGCCTCAT CATGGGAACC
GCCTCGGTGC TCGGCATCAG CATCCGCAAC TGCGAGCTGA TCGGGGCGCT GGAAACGGCC
AACGAGGAGT TGGAACTGAG GGTGGCCAAG AGAACGGAAG ACCTGGAGAA AAGCCGCCAG
AAGATGCAGC TGCAGCACGA GGAACTGGTG CGGACCTATT TCGAGCTGGA GGAGGAGACC
GCCCAGCGGT TGAGCGCGCT GGAGGAGCTG GCGCGCAAGG AACGGATGCT TTTGCAGCAA
AACCGGCTGG CGGCTCTCGG GGAGATGATC AGCAACATCG CGCACCAGTG GCGCCAGCCC
CTGAACGAAC TGGGGCTGAT CGTCCAGGAA CTTCCAGTCA TGTACGACCG GGGGGACTTC
AACAAGGAGT ACCTGCGCGA AAGCGTCTCC AAATTCATGA AGGTTTTGAG CCACACCTCG
AAGACCATCG ACGACTTCAG GACCTTCTTC AAACCGGACC GGGAGATGGT TCCTTTCCGG
GTCACCGAGG TGGTGGAAAA AGCGCTCTCC TTGGTCGGGG AGAGCTTGAA GCACCTGGAG
ATACAGGTGA CGGTCCATTC TGTGGACGAC CCGGCGATCA TGGGGCACCC AAACGAGTTC
TCCCAGGCGA TACTCAACAT CCTCTTCAAC GCCCGAGACG CCTTCAAAGA GCGCGGCATC
TCTTCCCGGC AGATCGAGAT CCGCATCTTC CCGGAAGACG AAACCTGCGT AGTCACCATA
GCCGACAACG CAGGCGGGAT CCCCGAGGAG ATCATGGACA AGATATTCGA CCCCTATTTC
ACCACCCGCG GACCTGAACA GGGGACCGGG ATCGGCCTCT ACATGACCAA GATGATCGTC
GAGAAAAACA TCCCCGGGAA ACTGTCGGTG CGCAACACGG AAAAGGGGGC GGAATTCAGG
ATCGAGGCGA GCAAAGTTGC GCTCCGCCCG CATTGA
 
Protein sequence
MQPSRLRDMP LYNSRIINSY ILLIKHKYSH VDISELLQYA GMKEYEVADQ AHWFSQEQID 
RFHEKLQQMT GNARISREAG RYAASPDVLG AMRQYILGLV DASSTFDIIS KTTAKFTRSS
SYQSRSIASN QVEITVTPYE PGLEKPYQCE NRIGFFEAIV LVFNHKMTDL QYFPEIRNLP
TIEHPECMFR GDPVCRYVVT WEKTLFTFLK KMRNILALPL AALNLALLAV GEAGLLTWVF
PSSLCVLLLV ALATETSEKK AIKESLWSTR DSIENLLDQI NLNYNNAMLT HEIGQRLGNY
TRIEEMLFDV AQIMRYRLEY DRGMILLADG GRKRLELRAS YGYKDEELAC LNSLPFLLEN
REQGDVYLEC FRGQRPFLVN DLSIGSVGEN TLACAHMSGT RAFICCPIVA DGASLGVLTV
ENVQVKRPLL ERDISLIMGT ASVLGISIRN CELIGALETA NEELELRVAK RTEDLEKSRQ
KMQLQHEELV RTYFELEEET AQRLSALEEL ARKERMLLQQ NRLAALGEMI SNIAHQWRQP
LNELGLIVQE LPVMYDRGDF NKEYLRESVS KFMKVLSHTS KTIDDFRTFF KPDREMVPFR
VTEVVEKALS LVGESLKHLE IQVTVHSVDD PAIMGHPNEF SQAILNILFN ARDAFKERGI
SSRQIEIRIF PEDETCVVTI ADNAGGIPEE IMDKIFDPYF TTRGPEQGTG IGLYMTKMIV
EKNIPGKLSV RNTEKGAEFR IEASKVALRP H