Gene M446_4771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4771 
Symbol 
ID6134802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5245422 
End bp5247296 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content79% 
IMG OID641644908 
Producthistidine kinase 
Protein accessionYP_001771535 
Protein GI170742880 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.415438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCG AACCCGCGAA CGGCCCCGTT CCGGCCGGCC GCACGGCGGG GCCGGCCGCC 
TTGCGGGATT CCGGACAGCC CGGCCGGGTC CTGCTCTGGG CGGCGGCCCT CCTGGCCGTC
GGTCTCGCCG CCTGGCTCGG CGGCCACGCG GCGGAGCGGG CCGCCCTGGC GGATCTGCGC
CGCGCCGCCG AGGCCGGGCT GACCCTCCAC GTGGCGGCGC TGCGGGCGGA ACTCCAGAAG
CAATCCTCCC TGCCGCTCGC GCTGGCGGCG GATCCGGAGA TTGCCGCGAG CCTCGCCCGC
GCCGCGCCGC CCGGGCTGAC CGCCCAGGTC AGCGGCCGGC TCGCCGGGAT CGCCGAGGCG
ACGGGCGCGG CCGCGATCTA CGTGGTGCGG GCGGACGGGC TCACGGTGGC GGCGAGCAAC
GCCGGCACGC CCACCAGCTT CGTGGGCAAC GACTACACGT TCCGCCCGTA TTTCCGCGAG
GCGCTGGCGG CCGGGTCGGG CGCGCAATTC GCCCTCGGCA CGGTGAGCGG CCGGCCCGGC
CTGTTCCTGT CGCGGCGCGT CGCGGGCGGC GGCGGGGTGG TCGTGGTCAA GGTCGAGTTC
GAGGCGGTCG AGGCGGGGTG GCGCGGCGGG GGCGCCGAGG TCTTCGTCAC CGATCCCCGC
GGCATCGTCC TGGTGGCGAG CGACCCCGCC CGGCGCTTCC GCACCCTCGG CCCGATCTCC
GAGGAGGAGC GGCGGCGGAT CCGCGAGGGG CTCGAATTCG GCGATGCGCC GCTCTCGCCG
CTGCCCTTCC ATCCGGGTCC GGAGGACGGG CTCCTGCGGG TGGCGGGCGG GGCCGCGCCG
GCGCGGCTCG TCCTGCCGGT CGACGCGCCC GTCCCGGGGA CGCGCTGGCG GCTCCACACC
CTCACCCCGG TCGGGGCGGC GGTGAGGCGC GAGCGGCTCC AGGCCCAGGT GCTGGCCGGG
CTGGTGGCGG CGGCGTCCTG CCTCGGCCTC GTCCTCCTGC GCCAGCGCCG CCAGCGCGTG
CGGGCCCGCC TCGCCGAGGA GGCGGCCCGC CGGGCGGAGC TGGAGGCGGC GGTGGCCGAG
CGCACCCGCG CGCTGAGCGA GGCGAACGCG CAGCTGCGGG AGGAGATGGC GGAGCGCCAG
CGCGCCGAGG CGGAGCGCGA GCGCCTCGGG CGCGACCTCG CCCACGCGGC GCGGCTCGCC
GCCCTCGGCC AGTTCGCCGC CAGCATGGCG CACGAGATCA ACCAGCCGCT CGCGGCGATC
CGCTCCTACG CGGACAACGC CGCGATCCTG ATCGGCCGCG GCCGGGCCGG GGAGGCGGCC
GAGAATTTCT CCGCCATCGG CCGGCTGACC GAGCGGATCG CCGGCCTGAC GCGCCAGCTC
AAGGGCTTCG CCCGGCGCGC CTCGGGCCGG CGCGAGCCGG TGCGGCTCGC GCGGGTGCTG
GCCCAGGCGG TCGAGATCGT CGCGAGCCAG GCGGCCGAGC GCGGCGTCGC CCTCGCGGTG
GAGGCCGGCG GGCCGGACCT GATCGTCCTC GGCGACGAGG CGCGGCTGGA GCAGGTGGTG
GTGAACCTGA TCCAGAACGC CCTCGACGCC GTGGCGGGCC GCCCCGATCC GCGGGTGAGC
GTGCGCTGCG CCGCGTCCGG CCCGCGCGCC GTGCTGGAGG TGTCCGACAA CGGCCCCGGC
CTGCCGGAGG GCGAGGCGGG CCGGGTGTTC GATCCCTTCT TCACCACCAA GCCGCAGGGG
CTCGGGCTCG GCCTCGCGAT CGCCCGCGGC ATCGTCGAGG AATGCGGCGG CACGCTCGCG
GCCGCGCGGA GCGAGGCGGG GGGCGCGCTG TTCCGCGTCG ACCTCGCCCG CGCAGAGGTG
GCGGAGGCGG CATGA
 
Protein sequence
MPGEPANGPV PAGRTAGPAA LRDSGQPGRV LLWAAALLAV GLAAWLGGHA AERAALADLR 
RAAEAGLTLH VAALRAELQK QSSLPLALAA DPEIAASLAR AAPPGLTAQV SGRLAGIAEA
TGAAAIYVVR ADGLTVAASN AGTPTSFVGN DYTFRPYFRE ALAAGSGAQF ALGTVSGRPG
LFLSRRVAGG GGVVVVKVEF EAVEAGWRGG GAEVFVTDPR GIVLVASDPA RRFRTLGPIS
EEERRRIREG LEFGDAPLSP LPFHPGPEDG LLRVAGGAAP ARLVLPVDAP VPGTRWRLHT
LTPVGAAVRR ERLQAQVLAG LVAAASCLGL VLLRQRRQRV RARLAEEAAR RAELEAAVAE
RTRALSEANA QLREEMAERQ RAEAERERLG RDLAHAARLA ALGQFAASMA HEINQPLAAI
RSYADNAAIL IGRGRAGEAA ENFSAIGRLT ERIAGLTRQL KGFARRASGR REPVRLARVL
AQAVEIVASQ AAERGVALAV EAGGPDLIVL GDEARLEQVV VNLIQNALDA VAGRPDPRVS
VRCAASGPRA VLEVSDNGPG LPEGEAGRVF DPFFTTKPQG LGLGLAIARG IVEECGGTLA
AARSEAGGAL FRVDLARAEV AEAA