Gene GYMC61_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2789 
Symbol 
ID8526666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2843052 
End bp2844662 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content50% 
IMG OID 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_003253851 
Protein GI261420169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTC AGACACGGTT AATGGTCATT ATTTGTTCAT TGTTGCTGCT GGTGATCGTG 
GTTTTAACGT TTTTGTTTCA GCATATGTTT GCGACGACGC TCAAACAACA AATCGGCATG
CGAGCGCTGA ATGTGGCGGA AACCGTCGCG TCCACTCCGC TCGTGCGCGA GGCGTTCCGC
GACCCGAATC CGTCAGTGCG CTTGCAGCCG TTTGCCGAAC ATATTCGCCG AAAGACAGGG
GCGGAATATG TAGTGATTGG CAACCATGAG GGAATCCGCT ACGCTCACCC ACTGCCAGAT
CGGATTGGAA AGCATATGGT AGGGGGCGAC AACGGCGAAG TGCTGAAAGG CAAAGCGATT
ATTTCCGAGG CGGTCGGTTC GCTTGGCCCG GCGATTCGCG GAAAAGCGCC GATTTTTGAT
GAAAACGGAC ATGTGATCGG CATTGTTTCG GTTGGTTTTT TGCTAGAGGA TATTCAGCGT
ACAGTATGGT CGTATCGTAT GAAGATGTTT CTCTTCTCCG TTTTTGCCCT CTTTCTCGGA
GCGGTGGGTG CAATGGCGAT CGCCAGCACG GTGAAAAAAT CGATTCATGG TCTTGAACCG
GAAGAAATCG GTTTGCTCTA CCAAGAAAAG CAGGCGATTT TAGAAGCGAT TCGCGAAGGA
ATTGTGGCCA TCAATCAAGA GGGAACGATT ACGATGGTCA ATCAAACCGC GTTGAAGCTG
CTTGGGTATG AGAACGAACG CGATGTACTA GGAATGCCGA TCTTACAGCT TATTCCCCAC
TCGAGGCTGC CTGAAGTCAT TCGGACGGGG CAGGCGGAAT TTGACGATGA AATGGTGCTA
GGTGAAGAAA CGGTTATTGC CAATCGCATC CCGATTAAAG ATAAAACGGG CGGCGTGATC
GGAGCGGTGT CGACGTTTCG CAATAAATCA GAACTGTATC GCCTGACAAA AGAGCTGTCA
CAGTTGCGAA GCTATGCCGA TGCTTTGCGC GCGCAAACAC ACGAATTTTC CAACAAGCTG
TATTTGATCT CCGGTCTTAT TCAACTGGAA TCGTATGAGG AAGCGCTTGA ACTGATTGCA
AAAGAAACCG ACTTGCAACA AAACATCGTC CGGTTTGTGA TGAAGGAAAT TCCCGATCCG
ATCATTGGCG GGCTGCTGAT CGGCAAGTTC AATCGCGCAA ACGAGCTGAA AATTGTGTTT
GAAATTGATC GGGAAAGCAG TTTCCGCGAT GTGCCTCCAT GGATGGACCG GGACCATCTG
GTAACCATTA TTGGAAATGT AGTAGATAAT GCCATGGAAG CGGTCCTTCA TAACGGAAAG
GAAGAGAAAC GAGTGGCCAT TTTTTTGACT GACCTCGGCG ATGATTTGAT TATTGAGGTC
GAGGATAACG GATTAGGCAT TGACCCAGTT GTAGCGGAAC GGATCTACGA CCGCGGCTTT
TCAACGAAAG CTGCCCGAGG ACGGGGGTAT GGGCTTGATC TAGTGAACCG GGCGTTGACA
ATGCTTGGGG GGCAAATGAC GTATCGATCT GAGCAAGGAG CAGGCACGGT ATTTACCATC
ATGATCCCAA AGCGACCGGC CCATGTCGGT CGGCAGTGCC GCCAAGGCTA G
 
Protein sequence
MKLQTRLMVI ICSLLLLVIV VLTFLFQHMF ATTLKQQIGM RALNVAETVA STPLVREAFR 
DPNPSVRLQP FAEHIRRKTG AEYVVIGNHE GIRYAHPLPD RIGKHMVGGD NGEVLKGKAI
ISEAVGSLGP AIRGKAPIFD ENGHVIGIVS VGFLLEDIQR TVWSYRMKMF LFSVFALFLG
AVGAMAIAST VKKSIHGLEP EEIGLLYQEK QAILEAIREG IVAINQEGTI TMVNQTALKL
LGYENERDVL GMPILQLIPH SRLPEVIRTG QAEFDDEMVL GEETVIANRI PIKDKTGGVI
GAVSTFRNKS ELYRLTKELS QLRSYADALR AQTHEFSNKL YLISGLIQLE SYEEALELIA
KETDLQQNIV RFVMKEIPDP IIGGLLIGKF NRANELKIVF EIDRESSFRD VPPWMDRDHL
VTIIGNVVDN AMEAVLHNGK EEKRVAIFLT DLGDDLIIEV EDNGLGIDPV VAERIYDRGF
STKAARGRGY GLDLVNRALT MLGGQMTYRS EQGAGTVFTI MIPKRPAHVG RQCRQG