Gene GM21_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3930 
Symbol 
ID8139304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4514665 
End bp4515693 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content64% 
IMG OID644871547 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003023705 
Protein GI253702516 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0182336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGG AACCTGAAAA AATCGCGGCC ATGCTGCTGG AGCGGATCGA CTGCGGCGTC 
GTGCATCTGG ACGACACCGG AAAGGTGCTC CTCGTCAACA GCAAGGCGGA AGAATTGCTT
CACGTCCAGC GCGGCCAGGT GCTGGGGCGC AGGGTCGACA TGCTCCCCTT GCGCACCCCG
CTTTACCGTG TGCTGAGCGA GGACTCGCGG GACGCCCCGG TGGAGATGAG CTTGGAAGGG
ACGGTGGTCC AGGTCCGCTC CTTCGGCCTG CCGGCCGAGT GCGGCGGAGG GGAGCTGTAC
CAGCTTCGCG ACGTAACCGC TGAAAGAAAG GAGAGGCGGC AGCGCGAGGA GTTCGTGGCC
ATGATGACCC ACGACCTCAA GTCGCCGCTC ACCGTGATCA TGGGGTACAT CCAGGCCCTT
TTGGGGGAAA AGGCGAAGAT CGACCCCTCG CTGCACCTGT TCCTCGGGGA GATGGACAAA
AGTTCGGTGA AGATGCTCTC CATGATCGAC GACGTGCTGG ACGCCTACCG GCTGGAGGCG
GGCCTTTTGC AGATCGACCG CCGCCGCTGC GACATCCACC CCTTGCTGGA GGGTTGCAGC
CGCGACGGCG AGCGCGAGGC GGCGGTGCAC GGCTCCTGCT TCCAAAGCGA GCTCTGCGAC
GACATCCCCC CGCTGGAACT CGACGCGAAG CAGATCAGCC GGGTCTTCGC CAACCTGATC
GGCAACGCGG TGAAATTCAC CCCCAGGCGC GGCACCATCA CCTTCAGCAG CGAGGTCCGG
GACGGGTTTC TCCGGGTTCA GGTCGCCGAT ACCGGCATCG GGATTCCGCC CGAGGAGCTG
CCGCGGATCT TCAACCAGTA TTTCCGGGCC CAGTCGGCGC ACGGTTTCAA GGGGACGGGG
CTTGGCCTCA CCATCAGCAA GGCGATCGTG GAAGCCCACG GCGGCAGCAT CGGCGTGGAG
AGCACGGCCG GCAAGGGGAG CCGCTTCTCG GTCCTTTTGC CGCTGCAGGA GAAAAAGGAA
GTCATTTAA
 
Protein sequence
MQEEPEKIAA MLLERIDCGV VHLDDTGKVL LVNSKAEELL HVQRGQVLGR RVDMLPLRTP 
LYRVLSEDSR DAPVEMSLEG TVVQVRSFGL PAECGGGELY QLRDVTAERK ERRQREEFVA
MMTHDLKSPL TVIMGYIQAL LGEKAKIDPS LHLFLGEMDK SSVKMLSMID DVLDAYRLEA
GLLQIDRRRC DIHPLLEGCS RDGEREAAVH GSCFQSELCD DIPPLELDAK QISRVFANLI
GNAVKFTPRR GTITFSSEVR DGFLRVQVAD TGIGIPPEEL PRIFNQYFRA QSAHGFKGTG
LGLTISKAIV EAHGGSIGVE STAGKGSRFS VLLPLQEKKE VI