Gene M446_5940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5940 
Symbol 
ID6132492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6528456 
End bp6530795 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content72% 
IMG OID641646042 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001772654 
Protein GI170743999 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.363869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTGT CGCGCCGCGT TCACGCGCCC GGGGCCTCCC CCGGTGCCCC GCCGGAGACG 
GCCCCGCGGG GGCCGGGGCT GTTCGGGGCC CTGGTCGTCG GGGTGGCGCT GGCGCTCGCC
CTCGCCACCT TCCTGATCCT GGTCGGGGCC ACCGGCATCG CCCCGACCCA CGACGTCGTC
GTCGCGCTGC TGCTCGGCAA CGTCGCCCTG GTGATCGGCC TCGTGGTGGT GATCGCCTGG
GAGGCCCGGG TCTTCCTGCG GGCGCGGCAG GCCAATGCCC GGGTGGCGCG GCTGCACACC
CGCATCGTCG GCCTGTTCAG CCTCATCGCC ACCCTGCCCA CCATCCTGCT CGCGGTCGTG
GCCTCGATCA CCCTGGAGCG CGGCCTCTCG CCCTGGTTCT CCGACCGGAT GCGCAACGTC
GTGCTGATGT CCGTCGACGT CGCCGACGCC TACCTGACCA ACCAGTGCCA GAGCCTCGCC
CGCGAGGCCC GCATCCTCTC GGACGATCTC ACCCGGGCGC GCCCCGCCTT CGAGGTCGAG
CGGGCGTGGT TCGAGAACTT CCTCACCGCG CGCGCGAGCT CCCTCGGGCT GCCGATCGCC
CGCATCATGC GCTCCGCCGA CGAGACCGTG GCGCGCGCCA ACATCGACGT GCTCAAGAAC
CCGCCGCTGC CCTCCGCGGC CGATTTCGAG GAGGCCGCCA AGTCCAACGA CCCGACCTGC
CTGCTGCCCC GCGAGGGCCG CGTCTTCGGG GCCCTGATGA AGCTGCCGGC CTACGGCGAC
GCCTACCTGC TGATCGAGCG CGAGGTGACG CGCCTCGCCG TCGAGTTCCC GGGCGTCTCG
CGGGCGGCGG CGACCGAGTT CCTCACCATC GATGCCGGGC GGCGCAGCGT GCAGATCGCC
TTCGCCAGCA TGTTCGCCCT GATCGCCCTG ATCGCCCTGC TCTCGGCGGT CTGGTTCGGG
CTCAACTTCG CCAACCGGTT CGTCGCCCCG ATCCGCCGCC TCATCAACGC CGCCGACCAA
GTCGCCTCCG GCAACTTCTA CGCCCAGGTC CCGCACCGCA AGACCGAGGG GGACCTCCAG
CACCTGGGCG AGAGCTTCAA CAAGATGACC CAGGAGCTGC GCCGCCAGCA CGACGGGCTG
ATCGAGGCGC GCGACCAGAT CGACCGCCGC CGCCGCTTCA CCGAGGCGGT CCTCGCCGGC
GTGCCGGCCG GCGTGCTCGG GATCGACGCG GAGGGCGTCA TCACCATCGC CAATCCCTCG
GCCGAGCGGA TGCTCGGCCT CACCCCGCAG GAACTCGTCG GCACGCCCCT GCGGGTCGCG
GTGCCGGAAC TGGCGGGCCT CCTCGGCGCC TCCGGCGAGG GGCGCCTGCG CCCGCTCCAG
CAGCAGATCC AGATCACCCG CAAGGGCCGC GAGCGCACCA TCGACGTGCG GGTCACCAGC
GAGCAGGCGC AGGGCATCGA CCGCGGCCAC GTCGTCACCC TCGACGACAT CACGGACCTC
GTCGCCGCCC AGCGCACCTC CGCCTGGGCC GACGTCGCCC GGCGCATCGC CCACGAGATC
AAGAACCCGC TCACCCCCAT CCAGCTCTCG GCCGAGCGCA TCCGCCGCAA GTACGGCAAG
GTCATCACCA CCGACAAGGA CGTGTTCGAG CAGTGCACCG CCACGATCGT GCGCCAAGTC
GACGACATCA AGCGGATGGT CGACGAGTTC TCGTCGTTCG CCCGGATGCC CAAGCCGGCG
ATCTCCCGCA ACGACCTGAC CGAGATCGTG AAGCAGAACC TGTTCATGAT GCGGGTGGCG
CATCCCGACA TCGATTTCGC GATGGAGGGG GAGGGCACCC GGATCAGCGC CGCCTTCGAC
ACGCGCCTCC TCTCCCAGGC GGTCACCAAC ATCCTCAAGA ACGCCGTCGA GGCGGTCCAG
GCGGTGCCGG AGGCGGAGCG CGGCCGCGGG CGGATCGCCG TGCGCCTCCT GGAGGAGGGG
GACGGCGCCG TGATCGAGAT CACCGACAAC GGCAAGGGTT TTCCGGCCGA AGGGCGCCAG
AGGCTGCTCG AGCCCTATAT GACGACCCGC GAGGGGGGGA CCGGCCTCGG GCTGGCGATC
GTGAGCAAGG TCTTGGAAGA GCACGGCGGC GGCATCGATC TCAACGACAA TCCGGCGGGG
CGGGGCGGGC AGGTGCGCAT GCGCCTCCTG CGCGAGGTGC CGCGCCCCGC CTCGCCCGAG
GGCGCGGCCG CCCCGGCCCA GAGCGCGGCG GCCCAAGGCG CGGCCGCCCA AGGCGCGGCG
GGGCAGGGCG CGGCGGGGCA GGACGCCCCG GCGCCGAGCG CGCAGGGGGC CGCACCGTGA
 
Protein sequence
MLLSRRVHAP GASPGAPPET APRGPGLFGA LVVGVALALA LATFLILVGA TGIAPTHDVV 
VALLLGNVAL VIGLVVVIAW EARVFLRARQ ANARVARLHT RIVGLFSLIA TLPTILLAVV
ASITLERGLS PWFSDRMRNV VLMSVDVADA YLTNQCQSLA REARILSDDL TRARPAFEVE
RAWFENFLTA RASSLGLPIA RIMRSADETV ARANIDVLKN PPLPSAADFE EAAKSNDPTC
LLPREGRVFG ALMKLPAYGD AYLLIEREVT RLAVEFPGVS RAAATEFLTI DAGRRSVQIA
FASMFALIAL IALLSAVWFG LNFANRFVAP IRRLINAADQ VASGNFYAQV PHRKTEGDLQ
HLGESFNKMT QELRRQHDGL IEARDQIDRR RRFTEAVLAG VPAGVLGIDA EGVITIANPS
AERMLGLTPQ ELVGTPLRVA VPELAGLLGA SGEGRLRPLQ QQIQITRKGR ERTIDVRVTS
EQAQGIDRGH VVTLDDITDL VAAQRTSAWA DVARRIAHEI KNPLTPIQLS AERIRRKYGK
VITTDKDVFE QCTATIVRQV DDIKRMVDEF SSFARMPKPA ISRNDLTEIV KQNLFMMRVA
HPDIDFAMEG EGTRISAAFD TRLLSQAVTN ILKNAVEAVQ AVPEAERGRG RIAVRLLEEG
DGAVIEITDN GKGFPAEGRQ RLLEPYMTTR EGGTGLGLAI VSKVLEEHGG GIDLNDNPAG
RGGQVRMRLL REVPRPASPE GAAAPAQSAA AQGAAAQGAA GQGAAGQDAP APSAQGAAP