Gene Gmet_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3478 
Symbol 
ID3739191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3904534 
End bp3907599 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content58% 
IMG OID637780766 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_386411 
Protein GI78224664 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA AGACCAGGAT GAGCGTTATC GCCTGTTGCC TTGTGGCGGG CTTTCTCGTG 
CTGACGGCTG TTCCTGCCTA CATCTACATT TCCAGCGAGC TAAAGAGGCA GATCTCCAGC
GAGCAGTTCG CATTGGCCAG GTATATGGCT GCCCATTTCG ACGACAATGT CGACATGGCG
AGGGAGAAGA TCGCAAAGTG CGCGGCAGCC CTCAATCCCG CTGAAGTGAA CGATCCGGCC
GGTCTTCGGG AATTTCTGGA GTTCGAGCAG GAACTCCAGA CATTTTTCGA CCAAGGGATG
TATGTGATCC GTCTTGACGG CACCCTGGTC GCGGCCACCT CCCGGGATGC TGCCCTGAGG
AAAATATCTC CCGACGAAAT TGATCTCCTC GTGAGGGGAA AAGCCGTGTC GAATGCCTGT
GCTTCGCCCC ACGGGCCGAT CCCCTGCCTG ACAGTGGCCG CTCCCATCCG TGACCGTTCC
GGAAGCCTGG TCGCGGTGCT GGCGGGACGG CTCGATGTCA CCAAAAAGCA TTTTCTCGGC
GAGCTTACCG CTTTGAAGGT GGGGAGCGAC GGATACGCGT ACGTTATCGA TTCCCGCCGT
ACTCTCATCA TGCATACGGA CCGGATCCGG ATCCTGGAAA CACTGCACGT CGGCGCCAAC
GAGGGAATCG AAAAGGCCCT CCATGGCTTC GAGGGAACCG TGGAGAACAC CAACGCAAGC
GGCGTGCGCG GCCTCACATC GTTCAAGCGA CTTTCCTCGG CCGGCTGGAT TCTCGCGGTT
CATCTTCCCC TCCAGGAGGC GTATGACTCC CTCTACCGTG TCCGGTTCTA CTTCGGGATC
ATGTTGGCGG CCGCCATGTT TTGTTCCGCG ATGGTTGTCT GGTTCGTAAT GGGAAGGGTT
GTGGCCCCCC TCCTTTCCCT GACCCGGCAC GTGAGGGACA TGGCCGAAAA GAACGGAGCG
GACAGGCTGG TACGCGTCGA TACCCCCGAC GAGATTGGCG ATCTGGCCCG GGCGTTCAAT
GGCATGGTCC GGAAGATCGA GCAGCAGCGG GGGGAACTGG AGCGGGCCAA GGATTTTCAC
CTCTTACTCT TTGAAGAGTT TCCCACCCTC ATTTGGCGGA TGGGCCTCGA CGGACGGTGC
AACTACCTGA ACCGGACATG GTTCGAGTTT ACCGGTAGGA GATCCGCCGA AGCTCTCGGA
GACGGCTGGC TCGAAGATAT CCATCCCGAC GACCGCCCGG AGGTTGAGCA CTGCTTTGCG
AAGGCATTCG CCGCCCGTAG CGCCTATGTC ATGGAATACA GACTTCTCCA CCGCAACGGA
GAATTCCGCT GGATCGAGGA CAATGGCAGA CCTTTCAATG ATCTAGACGG CAACTTTGCC
GGATACATCG GTTTCTCCCT CGACATAACC GAACGGATGA GAGCGGTAGA GGCGCTAAGG
GAGTCGGAGC GCTTTGCCCG TTCCACGGTC GATGCCCTGT CGAGCCATAT CGCCATTCTG
GACGCCACGG GAACCATTGT TGCAGTCAAC AAAGCGTGGC GGGAGTTCGT CCTGGCCAAT
CCACCTCTGT CAAACAATGT GAACGAAGGA GCCAATTATC TGGCTGTCTG CGATGCGGCG
GTCGGTGAGG ATGCCGAGAC GGCGCAATCC TTTGCCCAAG GGATACGGAA CGTGCTGGAG
GGGAAAGCTG CCGTTTTTTC CCGGGAATAT TCCTGCCATT CCCCCGAAGA GAAATGTTGG
TTTGTCGGGC GCGTCACCCG CTTTTCGGGA GATGGTCCAC CCCTCGTGGT TGTTGCCCAT
GAAAACGTGA CGGTCCGGAA GCTGGCGGAG GAGGAGCTGG CCAGAAGTCG GGAGGAGCTT
GTCGCGAAGC ACAAGGAGCT CAAAAGTGCC TTTCTCCAGG TGGCTCAAGG AAAGAAAGAG
TGGGAGCTCA CCCTTGATTG TATCGGTGAC TTGGTCATCA TGGCGGATGC CGCCGGTCGG
GTCAGGCGGT GTAACCGCGC CGTGACCGAG CTTGCCCGCA AGCCCGTGGA CGAGCTGATT
GGCCGACCGT GGCCCGAGGT TGTTATTCTT CCTGAGGTGG AAACCGATAG TTACGCGAAC
GGGGGAGGTG AAATACACCA CAGGGAAACA AATCGGTGGT TCTTGCTCAC CTCTTATCCG
TTCACCGGCA CTGGAACGGA GAAGGGCGCT GTCATTACGC TCCATGACGT GACCGAGGCC
AAGCGGTCTG CCCTTGAACT GGAGCAGGCC AACACCGAGC TGAAGGAAAC CCAGTCCCAG
ATGCTCCAGA GGGAGAAAAT GGCTTCCATC GGCCAGTTGG CGGCCGGCGT GGCCCACGAG
ATCAACAATC CCATGGGGTT CATTACCAGC AACCTGGGAA CGCTGCGCAA GTATGGGGAC
AAGTTGTTGG AATTCATGGC GTTTATTGCA GAGAAAGCGG AAAGTCACCT TGATCTGCGG
AAGGAAATCG AGGCTGAGCG GCGAAGGCTC AAAATCGACT ACGTGGCCGA CGATCTGGAG
AACCTCATCA CTGAGTCATT GGAGGGGGCC GAGAGGGTCA GAAAGATTGT TGCCGATCTC
AAGAGTTTTT CCCGGGTCGA CGAGGCTGAA TACAAGGTGG TTGACTTGAC CGAGTGTCTC
GACAGCACCA TCAACATCGT CTGGAACGAG CTGAAGTACA AGGCAACCCT GAAAAAGGAG
TACGGAGAAC TGCCGCCGCT CCGCTGCTAC CCCCAGCAGC TTAACCAGGT CTTCATGAAT
CTGCTCGTAA ATGCGGCCCA CGCCATCGAA ACCCAGGGAG AGATCACCGT CCGGACCAGG
TCGAAAGAGG GGTGGGTCAG CATCGCCATC GAGGATAACG GCTGCGGGAT TCCCGACGAT
GTCATGGCGA GGATCTTCGA GCCGTTTTTT ACCACCAAAG AGGTCGGAAG GGGGACGGGG
CTCGGGCTCT CCATCAGCTA CGACATAGTG AAGAAGCACG GCGGTGAGAT CACTGTGACC
AGCGAACAGG GGAAGGGAAC CACCTTCGTC GTCCGTCTGC CGTCATGCGC TGCTGGCCAG
GGGTAG
 
Protein sequence
MSIKTRMSVI ACCLVAGFLV LTAVPAYIYI SSELKRQISS EQFALARYMA AHFDDNVDMA 
REKIAKCAAA LNPAEVNDPA GLREFLEFEQ ELQTFFDQGM YVIRLDGTLV AATSRDAALR
KISPDEIDLL VRGKAVSNAC ASPHGPIPCL TVAAPIRDRS GSLVAVLAGR LDVTKKHFLG
ELTALKVGSD GYAYVIDSRR TLIMHTDRIR ILETLHVGAN EGIEKALHGF EGTVENTNAS
GVRGLTSFKR LSSAGWILAV HLPLQEAYDS LYRVRFYFGI MLAAAMFCSA MVVWFVMGRV
VAPLLSLTRH VRDMAEKNGA DRLVRVDTPD EIGDLARAFN GMVRKIEQQR GELERAKDFH
LLLFEEFPTL IWRMGLDGRC NYLNRTWFEF TGRRSAEALG DGWLEDIHPD DRPEVEHCFA
KAFAARSAYV MEYRLLHRNG EFRWIEDNGR PFNDLDGNFA GYIGFSLDIT ERMRAVEALR
ESERFARSTV DALSSHIAIL DATGTIVAVN KAWREFVLAN PPLSNNVNEG ANYLAVCDAA
VGEDAETAQS FAQGIRNVLE GKAAVFSREY SCHSPEEKCW FVGRVTRFSG DGPPLVVVAH
ENVTVRKLAE EELARSREEL VAKHKELKSA FLQVAQGKKE WELTLDCIGD LVIMADAAGR
VRRCNRAVTE LARKPVDELI GRPWPEVVIL PEVETDSYAN GGGEIHHRET NRWFLLTSYP
FTGTGTEKGA VITLHDVTEA KRSALELEQA NTELKETQSQ MLQREKMASI GQLAAGVAHE
INNPMGFITS NLGTLRKYGD KLLEFMAFIA EKAESHLDLR KEIEAERRRL KIDYVADDLE
NLITESLEGA ERVRKIVADL KSFSRVDEAE YKVVDLTECL DSTINIVWNE LKYKATLKKE
YGELPPLRCY PQQLNQVFMN LLVNAAHAIE TQGEITVRTR SKEGWVSIAI EDNGCGIPDD
VMARIFEPFF TTKEVGRGTG LGLSISYDIV KKHGGEITVT SEQGKGTTFV VRLPSCAAGQ
G