Gene MCA2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2331 
Symbol 
ID3102419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2525693 
End bp2527093 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content66% 
IMG OID637171474 
Productsensor histidine kinase 
Protein accessionYP_114747 
Protein GI53803481 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCCTCG GTTTACGGGC ACGTCTGATC CTCGTCCACC TGGCCGTCGT CGTCATCGTG 
CTGGCCTGCT CCGCCGCCGG CGCCTACTGG ATGCTGCGCC AGGCGGTCCA CAGTCGGCTC
GACGCCGCGC TGCTGGCCCT TGCGGAAACC GAACGGGCGA TGCTGCTGGA AGGCGGAGAA
CAGTCGATCA AGATCCATGA AGCCGCCCCC GGCCCCGCGC CTCCCTCCTT CATCCGTCTC
GACCGGCTGG TGCAGATTGT CGACGGCGAC GGCCGCGTTC TGGCACGTAG CGCAAACCTC
GGCACCGCTC GGCTTCCCTC GCCGCCCGGC CTGCTGTCGC GCCTGAGCCG GGGGGAAACG
GTCTTCGACA CCCTCCCGGC GTTCGGCGAG GAACCGGTTC GAATGGTATC CATTCCGGTA
CGGAAGGACG GTTCGCGACT GGCGATCCAG GTTGCCGGAT CACTGGATGA CGCAAGAAAC
GTCATGAAAT CGGCCGGCCT GTTGTTCATC GTCATGACCT CCGGTTTGCT GGTGGCGGTA
GGCATTGCCG GCACATCGCT GACTCGCAGG GCATTCCACG CGATCGACGA GGTCGTCCGT
CAAGCCCGCC GGATCGGTGA GGCCAACCTC GGCGAACGTC TCCCTCATCC GGGCAGCCGC
GACGAAATCG GCAGACTGGT CGACACGCTG AACGAGATGC TCGGTCGTCT GGAGCGGAGC
TTCGAGGTCC AGCGCCGCTT CACGGCTGAC GCCTCCCACG AACTTCGCTC GCCTTTGTCC
CGACTGCGCG CGGAACTCGA AATCACCCTC CGTCGCCCCC GGCGCCCGGA CGAATATGAA
CGGACCCTGC ACTCCTGCCT CGAGGAAGTG GAACGGCTGA CCCAACTGGT GGAAGCACTG
CTCGAATTGG CACGGCTCGA CGCACAGCAG GAAGCCATTC CCGGCGAGAA CGTTTGCCTG
AACACGGTTC TGGCCGAAGT CGTGCGTCGC CAGCAACCGC TGGCGCAGGA ACGCTCGATC
GGGATCGTCG TCGAAACCTC GCAACCCGTG ACCGCCTGGG TGTCCGGCGC TTCGATCGGC
GTCGTTTTCT CCAACGTCCT GGAGAACGCC TTGAAGTTCT CGCCACCCGG CGGAACCGTC
CACATCGGAC TGGCTGCCGA CGCACGGGAG GCGGTAGTGA GCATTTGCGA CAACGGACCG
GGAATCGAGC CCGGCGAAAA AGACCGGCTG TTCGAGCGTT TTTTCCGGGG TTCGGTCGCC
CGCTCCGGTG CTATGCCCGG AACCGGCCTG GGCCTCGCGC TGTCACAGGC CCTCGTCCGG
CATTGCGGGG GAACCATCGA AGCCACGAAT ATCCCGGGGG GCGGCGCCCG GTTCACGATA
CGCCTGCCAT TGGGACCCTG A
 
Protein sequence
MILGLRARLI LVHLAVVVIV LACSAAGAYW MLRQAVHSRL DAALLALAET ERAMLLEGGE 
QSIKIHEAAP GPAPPSFIRL DRLVQIVDGD GRVLARSANL GTARLPSPPG LLSRLSRGET
VFDTLPAFGE EPVRMVSIPV RKDGSRLAIQ VAGSLDDARN VMKSAGLLFI VMTSGLLVAV
GIAGTSLTRR AFHAIDEVVR QARRIGEANL GERLPHPGSR DEIGRLVDTL NEMLGRLERS
FEVQRRFTAD ASHELRSPLS RLRAELEITL RRPRRPDEYE RTLHSCLEEV ERLTQLVEAL
LELARLDAQQ EAIPGENVCL NTVLAEVVRR QQPLAQERSI GIVVETSQPV TAWVSGASIG
VVFSNVLENA LKFSPPGGTV HIGLAADARE AVVSICDNGP GIEPGEKDRL FERFFRGSVA
RSGAMPGTGL GLALSQALVR HCGGTIEATN IPGGGARFTI RLPLGP