Gene M446_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5104 
Symbol 
ID6132374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5605058 
End bp5606188 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content77% 
IMG OID641645239 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001771864 
Protein GI170743209 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0174826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGATC TGCTCGCGCC CGAGGCTGCG CCGGAGGCCC CTTTCGCCCT GATATGGACC 
GCGCTCGGCG CCTCGCCCCT GCCGTCGCTG CTGCTGGGCC GCGACGGGGC GATCCGCCGG
GCGAACGCGG CCGCCGCCCG CGCCCTCGGC GAGGAGCCGG AGGCCCTCGC CGGGCAGGCC
CTCGCGGCCC TCGGCGCCGA CGCGGAGAAT GCGGGCGCCC TGCGGGCGCT CCTGCGCGGC
GACGCGGCCG CGAGCGCGGA CCTGCTCCTG CGCCGCCGCG ACGGGTGCAG CTTCTGGGCC
TCGCTCCACC TCTCGCCCAT CGACGGCGCG CCGGACCTGC GCCTCGTCCA GTGGCTCGAC
GTCTCCCGCC GCCGCGACCT CGAATCCGCC CTCGCCCAGG CGCAGCGGCG CGAGGCCCTG
GGCCAGCTCA CCAACGGCGT CGCCCACGAG TTCAACAACC TGCTGCAGAT CCTGGTCGGC
TACGTCGACG GCCTGAAGCG CCGCCTCGGC GAGCACCCGG ACCCCTTCGT GCAGCGCGCG
CTCACCCGGG CCACCGACGC GGCCGAGCGC GCCTCGGCCC TGACCCGCCA CCTCCTCGCC
TTCTCGCGCA AGCACCGGCC GGAGCCGCGC GCGACCGACC TCAACGCCCT CCTGCGCGGC
TGCGAGCCCC GCGCCCGCGC GATCCTGGGC GAGGGCGTCG CGCTCACCCT CGCCCTCGAC
GAGGCGCTGT GGCCGGCCTG CCTCGACCCG GTCCAGACCG AGTTCATCCT CGCGGTCGTG
CTCACCAACG CCCGCGAGGC GATGCCGGAG GGCGGCCGCG TCACCCTGAC CACCGCCAAC
CACGGCGGCG AGGGCGTCGC GGCGGACGGC ACCCTCGGCC GCCACGTGGT GCTGACCATC
GCCGATACGG GCAGCGGCAT GCCCCCCGAG GTGCTGGCCC GCGCCCTCGA ACCCTTCTTC
ACCACCCGCG AGCCCGGCCG CGGCACCGGG CTCGCCATCC TCCACGCGCT GATGAAGCGC
CAGGGCGGCG CGGTCGGGCT GCGCAGCACC CTCGGCGAGG GCACGATTCT GCGCCTCAGC
TTCCCGGCCG CCGATCCGCC CCGCCCGCCC CGTCCCGGGG CCGCGGCCTG A
 
Protein sequence
MSDLLAPEAA PEAPFALIWT ALGASPLPSL LLGRDGAIRR ANAAAARALG EEPEALAGQA 
LAALGADAEN AGALRALLRG DAAASADLLL RRRDGCSFWA SLHLSPIDGA PDLRLVQWLD
VSRRRDLESA LAQAQRREAL GQLTNGVAHE FNNLLQILVG YVDGLKRRLG EHPDPFVQRA
LTRATDAAER ASALTRHLLA FSRKHRPEPR ATDLNALLRG CEPRARAILG EGVALTLALD
EALWPACLDP VQTEFILAVV LTNAREAMPE GGRVTLTTAN HGGEGVAADG TLGRHVVLTI
ADTGSGMPPE VLARALEPFF TTREPGRGTG LAILHALMKR QGGAVGLRST LGEGTILRLS
FPAADPPRPP RPGAAA