Gene GM21_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3104 
Symbol 
ID8138454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3597010 
End bp3598467 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content65% 
IMG OID644870708 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003022890 
Protein GI253701701 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones148 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACGAA AAACGACCCC GCCGGCCAAG GCCGGACCAG AGCCCAACAT CCCGGACCCG 
AACCTGCTCC TGAGCATCGT GGGGAGCCTT TCCAACGGCA TCTACGTGAT CCAGGACGGC
CGCACCATCT TCCTGAACGA GCGCTTCGCG GAGATCTTCG GCTACCCGGC AGTTGCGCAG
CTGATCGGGG AACGGATGTA CGACACCGTC TACCCCGACA GGCAGAGCGT GGAGCTCTTC
AGGTCGGTGA ACGAGAAGGT GATGGCCGGA GGCTCCCAGA ACACCTCATG GGGGCAGCCC
TGCAAAAGAA CCGACGGCAC CACCTTCTGG CTCGAGGTGG AGGCAAGGCG GATCGAAGTG
GACGGGCGCC CCGCGGTCCT CGGCACCTTT CTCGATCACA CCGACTGCAA GCTCCTGGGG
CAGGCGATGC ACGCCTCCCA GGCGACGCTG CACATGCTCC TGGACGCCAT GGAGGACCGG
GTCTACGTGG TCACCGACAA GTTCCACATC GTCTACGCCA ACAGAAAGAT GCAGGAGGGG
ATGAGCGGCG ACATAACCGA CGAGCTTTGC TACCGGGTCT GCCGCGGCCT GTCCGAGCGC
TGCAGCGACT GCTCAAGCGA CGAGGTCTTC GCCACCGGAA AGCCGGTCTA CAAGGAGTTT
TTCAACGAGC GGACCCAGCG CTGGTACTCC GTCATCGAAC TCGCCGTGCG CATGCCCGGC
ATCGAGCGCC CGACCAAACT CGCCGTCGCC CGCGACATCA CGGTGCGAAA GGAGGCCGAG
GAGAAGGTGC GGGCCCTGTC GCACCGGCTT TTCAGCGCCC AGGAGGACGA GAGAAGGCAC
CTGTCGCGGG AGCTTCACGA CGATCTCGGG CAGCGCCTGA ACGCGGTGAA GATCGGCGTG
GACACCCTGG CCTGCGACCT GCCGCAGCTG CCGCCGGACT TCCTGTCCCG GGTGGGCTAC
CTCTCGGAGA TACTGCAGTC GACCATCCAG TCGGTGCGCG ACATCTGCTC GGGGCTGCGT
CCCTCTTCGC TGGAGCGGCG CGGCCTGGTG CAGGCGATCA CCAGCGACTG CGCGGAGCTT
GCCTCCGTGC ACGGGCTGAA GATAGACGTG AAATCCAGCG GCATGAAGAA GGTGCTGCTG
GACAACGAGG CGGAGATCAA CCTTTACCGG GTCTTTCAGG AGTCGCTGCA CAACGTGGTG
AAGCACGCCC AGGCAAGCCG CGTCTCGGTG AGCCTCATCG CCTCCCATCC GATCGTCAGG
ATGCGGATCG AGGACGACGG CAAGGGTTTC GCCCCTTGCG CGAGGGTGGA CGGCGGCGGA
GGGGAGCGGG GGCTCGGCCT GGTGAGCATC GCGGAACGGG TCGATCTCAT GGGGGGGAGC
CTGGATGTGG TGTCGCGCCC GGGGGTCGGG ACCCGGATCG TGGTGGAGAT CCCGGCCAGC
GGCTATGGAA TTCAATAG
 
Protein sequence
MPRKTTPPAK AGPEPNIPDP NLLLSIVGSL SNGIYVIQDG RTIFLNERFA EIFGYPAVAQ 
LIGERMYDTV YPDRQSVELF RSVNEKVMAG GSQNTSWGQP CKRTDGTTFW LEVEARRIEV
DGRPAVLGTF LDHTDCKLLG QAMHASQATL HMLLDAMEDR VYVVTDKFHI VYANRKMQEG
MSGDITDELC YRVCRGLSER CSDCSSDEVF ATGKPVYKEF FNERTQRWYS VIELAVRMPG
IERPTKLAVA RDITVRKEAE EKVRALSHRL FSAQEDERRH LSRELHDDLG QRLNAVKIGV
DTLACDLPQL PPDFLSRVGY LSEILQSTIQ SVRDICSGLR PSSLERRGLV QAITSDCAEL
ASVHGLKIDV KSSGMKKVLL DNEAEINLYR VFQESLHNVV KHAQASRVSV SLIASHPIVR
MRIEDDGKGF APCARVDGGG GERGLGLVSI AERVDLMGGS LDVVSRPGVG TRIVVEIPAS
GYGIQ