Gene GM21_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3096 
Symbol 
ID8138446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3585769 
End bp3587748 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content63% 
IMG OID644870700 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022882 
Protein GI253701693 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones113 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATG TCATCGAGAC CGGCGCTGTA CCGGTACCGC ACGATCTTTT CGCACCCGCG 
CTGGAGGGAA ACCTCGCCGA GAACCAGCAG TACCTCGCCA CTATCCTGGC CACCGCACAG
GTAGGGATAC TGGTGATAGA CAGCGAGTCC CACGTCATCG TGGAGGCGAA CCCCAAGGCC
GTGGAGGTGA TCGGGGTGCC ACGGGAGGAG ATCATCGGCT CGGTCTGCCA CCGCTTCATC
TGCGTGGCGG AGCTGGGGCA GTGCCCGGTG ACCGACCTCG GGCAGTGCAT CCATTGCGGG
GAGCGGGAGC TGATCACGGC CAAAGGGGAG CAGGTGACGG TCGTGAAGAC CGTGGCGAGT
ATCACGCTTG GGGGCAGAGC GTATCTTGTC GAGACCTTCC TTGACATCTC CGACCGCAAG
AAAGCGGAGC AGGCGCTGCA ACGGAGCGAG GAGCGCTGCC GCGACATTCT GGACAACGCC
AACGACCTGA TCCAGAGCGT CGACGCAAAC GGCGCCTTCA TCTACGTGAA CCGGGCCTGG
AAGCAGACCA TGGGGTACAG CGACGAGGAG GTCTCCCGCC TCACCATCTT CGACGTCATC
GCCCCCTCCA GCAAGGAGCA CTGCTCCCTT TTGTTCCGCA GGATCATGAA TGGAGAAAAA
GTCCCGGTGG TCGAGACCGA GTTCATCACC AAGGACGGGT CGGTGGTGGT GCTCGAGGGG
AGCATCAACT GCAAACACCT GGGGGGGCAG CTTCTGGGAA CGCGCGGCAT CTTCCGTGAC
ATCACCGAGC GCAAGAGGAT GCAGGCGGAA CTGATGCAAA GCGAGGACCG CTACCGCAAG
CTCTTCGAGA ACGCGCCGGT GGCTATCGTG GTGCAGTGCG AGGGGGTCTA CGTCTGCGCC
AACAACGAGG CGTGCCGCAT GCTGGGGCGC GACCTGGTCG GCGTCGATGT CCTCTCCACC
GTCCATCCCG ACTACCGCGA CACCGTAATG GAGCGGATCC TGCGGGTGAG CGAGACAGGC
GAGCCGTCGC CCCTTCTGGA GCAGAAGATG CTCCGCTTGG ACGGCAGCAG CATCGATGTG
GAGGTGACCG GCAGCAGCAT CGTCTTCAAG GGGAAAAAAG CGACCCAGGC GGTGATCCGG
GACATCACCG AAAGAAGGCT TGCCGAAGAG CAGCGCCGCG AGTGGAACCT CAGGCTGGAA
AAGGAGGTGG AGGCGAAGAC CAGGCACCTC AAGGAGGCGC AGGCGAAGCT GATCCAGTCG
GAGAAGATGG CGACCCTGGG CGAGGTGATC TCCGGAGCCT CGCATGAACT AAATAACCCG
CTCGCCGGGA TCCTCGGGGC GATCCAGATG CTCAGAAAGA GCGCGCTGGC CCAGCCGATC
GAGCCGGAAC TCCTGGAGGG GATCGACGTC CTGGAAAGCA TCGAGAGCGC CGCTATACGC
TGCCAGAACA TAGTCGCCGA CCTGATCCGC TTCTCGACCC AGGCCCACTG CAACTTCAGC
GAGATCGACA TCAACCAGGT GCTCAGGGAC ACCCTGGAGA TCATGGCCGC CCCCTTCGCC
GATCTGGGGA TCCAGGTGGA GCTTGACTCC GATCCGGCGG TGCCGCTGAT AGAGGGGGAT
TTCGTCAAGC TCCTCGAGGT GTACGTGAGC CTTTTGCGCA ACGCCCAGAA CGCGCTTCCC
GACGGGGGGA CGATATACCT CGGCACCAAG GTGGTGAAGA ATTACGGCGA GCCGCCGCAG
GTGGCGGTCA CCATCCGCGA CACCGGGTGC GGCATCCCTC CCCAAAACCT CTCCAAGATC
TTCGATCCCT TCTTCACCAC GAAGCCGGTC GGGCGCGGGC CCGGGCTCGG GCTCACGGTG
AGCTACGGCA TAGTGAAACG CCACGGCGGG GATATCGACG TGCGCAGCAC GGTGGGGAAG
GGGACCGAAG TGACCGTGAC CGTGCCGCTG CGGCAGCCGA AACCGGGAAG CCTCTCCTGA
 
Protein sequence
MSNVIETGAV PVPHDLFAPA LEGNLAENQQ YLATILATAQ VGILVIDSES HVIVEANPKA 
VEVIGVPREE IIGSVCHRFI CVAELGQCPV TDLGQCIHCG ERELITAKGE QVTVVKTVAS
ITLGGRAYLV ETFLDISDRK KAEQALQRSE ERCRDILDNA NDLIQSVDAN GAFIYVNRAW
KQTMGYSDEE VSRLTIFDVI APSSKEHCSL LFRRIMNGEK VPVVETEFIT KDGSVVVLEG
SINCKHLGGQ LLGTRGIFRD ITERKRMQAE LMQSEDRYRK LFENAPVAIV VQCEGVYVCA
NNEACRMLGR DLVGVDVLST VHPDYRDTVM ERILRVSETG EPSPLLEQKM LRLDGSSIDV
EVTGSSIVFK GKKATQAVIR DITERRLAEE QRREWNLRLE KEVEAKTRHL KEAQAKLIQS
EKMATLGEVI SGASHELNNP LAGILGAIQM LRKSALAQPI EPELLEGIDV LESIESAAIR
CQNIVADLIR FSTQAHCNFS EIDINQVLRD TLEIMAAPFA DLGIQVELDS DPAVPLIEGD
FVKLLEVYVS LLRNAQNALP DGGTIYLGTK VVKNYGEPPQ VAVTIRDTGC GIPPQNLSKI
FDPFFTTKPV GRGPGLGLTV SYGIVKRHGG DIDVRSTVGK GTEVTVTVPL RQPKPGSLS