Gene GM21_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2068 
Symbol 
ID8137404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2396617 
End bp2398875 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content61% 
IMG OID644869683 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021878 
Protein GI253700689 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000000000000882027 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAGCC GCCTCGAAGT AAAGATCATC GTCTCCCTCG CCCTCATCCT GACCTTGCTC 
ATCGCCGTTT ACGGCTGGTG GATCGGCAGC CGGCAGACCT CTGTCTACAT TCAGACCCTC
TCCGACAACC TGCGCATCCT CTCCCGCAGC AACGCCGACC ACGCGGCAAA TTTCATGGTG
ATAAAGGAAT ACGCCGGGTT GGAAGCGCAC ATGCTGGACA GCGCCAACCT CCCCGAGGTG
GTCAACATCC AGGTCGCCGA ACAGGATGGG AACCTTCTGT GCAACATCGA TCGTCCCAAC
CGCGCCGAGC TGCCCAAGGT GAACTACAAC GTCAAATGGG TCAGGGTGCC GCAGGGGTCC
CAACCCGTTC TTAAACGCGA GGGACACCAA CTGGTGAGCT GGTGTCCCAT CGTCGCCGGC
GAACAGCTTG GCTGGGTAAA AATCGTCCTG AGCCTGGATA CGGCGCAGCG CCTCCTGCAG
GCCACCTGGC GCAGCACGCT TGTCATCGGC TTGCTCTGGA TCATCGTGGG AACCGTCCTG
ATGGCCATGG TGGTCAAGCC GCCGCTTCGG GCGGTCCGGG AACTGAGCCG CTTCGCGGAC
GAGCTGCAAA ACCGGAAGGG AGCGCAGGTC TCGGTGCCGC GCGGTGTGTA CGAAATCGAC
ATGCTGGCGG ATGCCCTGAA TCATTCGTCT AGAGAACTCC TGCTGGCCGA GCAGCGCCTG
TTAGCGGAAC AGGAACGCCT CTCCGTAACG CTGCAATCCA TCGGCGACGG CGTCATCGCC
ACCGATACCG AGAGCAGAAT TGTGCTGGTC AACCACGTTG CCGAGCTTAT GACAGGCTGG
ACCGAGAAAC AGGCCACAGG CGTAGGCCTG GACCAGGTTC TTTGCATCGA GCCAAGCGAC
TCCCTCCCGG ACGTCCGGGA AGCGCTTCAG GCGGTGATGG AGCGAAGGCA GACCATAGAA
CTCCCCGACC TGCACCGGGT GTGGTCCCGG GACGGCGTCT CCCGCACGGT AACCGTGATC
GGCGCTCCCA TCATAGACAG CGCGGCCCGG CTGGCAGGTA TGGTGCTAGT GATCCGCGAC
CTGACGGAGA AGGCGAAGAT GGAGGCCGAG AAGACGGGGC TTGCCGAGCA ACTGCTCCAG
TCGCAGAAGA TGGAGGCGGT AGGCAAGCTG GCCGGAGGGG TGGCGCATGA TTTCAACAAC
ATGCTGGGAG TCATCATAGG CAATGCCGAG CTGGCCATGA TGGGTGTCGA GCCTTCGGGG
AAACTCCACG ACCGGCTGCA GGGGATCCTC GATGCCGCCA ACCGCTCCGC CGAGATCACC
CGCCAGTTGC TGGCTTTCTC AAGGCAGCAG CACGCAGAGC CCAAGGTACT CGATCTGAAC
GTAGTTATCG GCAAGATGCT GAAGATGCTG CACAGGCTGA TTGGCGAGGA TATCGAAGTC
GTCTGGTCGC CGGGACAAGA CATCTGGAAA GTGAAACTGG ATCCGAGCCA GTTGGACCAG
ATCATGGCCA ACCTCTGCGT CAACGCCAGG GACGCCATCG CAGGAATCGG GAGAATGGAC
ATCCGGACGG AGAACGTCGA ATTGTCCCCT GAGAAACGAG GCCCTGCGGA GATGCCCCAG
GGAAGGTGCG TGATGCTGGA GGTGAGCGAC AGCGGCTGCG GCATGAGGCG CGAGGTGATG
GAAAGGATCT TCGAACCCTT CTATACGACA AAGGAAGTCG GGCGCGGAAC CGGGCTGGGA
CTGGCGACCG TTTTCGGCAT CGTCAAGCAA AACGACGGGC ACATCGAGGT GCGGAGCGAG
CCCGGGGCCG GTACCAGCTT CAGGCTCTAT TTCCCTGCGG TTGAGGGGGA AGCTCAGGAC
CACAAGAAAG GGAGCGTCGC GGCGATCAGG GGGAACGAGA CCATACTCGT CGTGGAGGAC
GAGCCGTCGA TCAATGCGCT CGCCACCACC ATGCTGTCGG AGTTGGGGTA CAGGGTTTTT
TCAGCGGGGA CACCTGGCGA GGCAGTTAAG GTGGCGGACG GCGGCCAGGT GAAGATAGAC
CTGTTGCTGA CGGATATAAT CATGCCTGAT ATGAACGGGC GCGACTTGTC CGAGTTGCTG
CACCGGTCGC ATCCCGACAT GAAGTGCCTG TTCATGTCGG GGTATACCTC GGACATCATA
TCGGAGCGTG GCAACATAGG GCGGGAGGTC TGTTTTCTGC AAAAGCCCTT CACCACCCAG
ACGTTGGCGG CGAAGGTCAG AGAGGCGCTG CAGGCCTAG
 
Protein sequence
MTSRLEVKII VSLALILTLL IAVYGWWIGS RQTSVYIQTL SDNLRILSRS NADHAANFMV 
IKEYAGLEAH MLDSANLPEV VNIQVAEQDG NLLCNIDRPN RAELPKVNYN VKWVRVPQGS
QPVLKREGHQ LVSWCPIVAG EQLGWVKIVL SLDTAQRLLQ ATWRSTLVIG LLWIIVGTVL
MAMVVKPPLR AVRELSRFAD ELQNRKGAQV SVPRGVYEID MLADALNHSS RELLLAEQRL
LAEQERLSVT LQSIGDGVIA TDTESRIVLV NHVAELMTGW TEKQATGVGL DQVLCIEPSD
SLPDVREALQ AVMERRQTIE LPDLHRVWSR DGVSRTVTVI GAPIIDSAAR LAGMVLVIRD
LTEKAKMEAE KTGLAEQLLQ SQKMEAVGKL AGGVAHDFNN MLGVIIGNAE LAMMGVEPSG
KLHDRLQGIL DAANRSAEIT RQLLAFSRQQ HAEPKVLDLN VVIGKMLKML HRLIGEDIEV
VWSPGQDIWK VKLDPSQLDQ IMANLCVNAR DAIAGIGRMD IRTENVELSP EKRGPAEMPQ
GRCVMLEVSD SGCGMRREVM ERIFEPFYTT KEVGRGTGLG LATVFGIVKQ NDGHIEVRSE
PGAGTSFRLY FPAVEGEAQD HKKGSVAAIR GNETILVVED EPSINALATT MLSELGYRVF
SAGTPGEAVK VADGGQVKID LLLTDIIMPD MNGRDLSELL HRSHPDMKCL FMSGYTSDII
SERGNIGREV CFLQKPFTTQ TLAAKVREAL QA