Gene M446_3179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3179 
Symbol 
ID6131560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3518011 
End bp3519180 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content73% 
IMG OID641643367 
ProductCBS domain-containing protein 
Protein accessionYP_001770019 
Protein GI170741364 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0353609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0370442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG ATCGAAGTCG TGGCGCCGCC GCGACCGCCC AGCCCGCCCC GGACGAGGAC 
AGTCCGGCCC GTGAGCCGTG GTATGACCGT CTCCTCACCA TCTTCCACCT GAAGCCGCGG
GAGGCTCCGC GCGACGAGAT CACCGACGCG CTGGCGGAGG CGCAGTCGGG CGATCACGCC
TTCTCGCCCG TCGAGCGGGC GATGCTGAAG AACGTGCTGA GCCTGCACCG GGTGCGGGTC
GACGACGTGA TGGTGCCGCG GGCCGACATC GTCGCCGTGC CGGCGGAGAT CTCCCTCGGC
GAACTCCTGA AGGTGTTCCG GACGGCGGGC CATTCGCGCC TGCCGGTCTA CGGCGACACC
CTGGACGATC CCCGCGGCAT GGTCCACATC CGCGACTTCG TCGACCACCT CGCCACCCGC
GCCGAGGCCG GCGCGGCCCA CGGCGCCAAA TCCCCGGCCA GATCCCCGGC CAACTCCCCG
GCCAACTCCC CGGCCAAGCC CGCGGCCGAG CCGCCGCCGG TGATCCAGGG GGACGGGCGG
GCGCGCCGGC CGCACCTCGC CCGGACGCCC GACCTGTGCG AGGTCGATCT CGACCTCTCC
CTCGCCGCCA CGCGGATCCT GCGGCCCGTG CTCTACGTGC CGCCCTCGAT GCCGGCGATC
GACCTCCTGG TGCGGATGCA GGCCAGCCGG ACCCACATGG CCCTCGTCAT CGACGAGTAT
GGCGGGACCG ACGGGCTGAT CTCGATCGAG GACCTGATCG AGGTCGTGGT CGGCGACATC
GAGGACGAGC ACGACGTGGC CGAGGGGCAC CGGGTGCTGC GGGTCGACGG CGAGGCCGAG
ATCTACGTGG CGGATGCGCG CGCGAGCCTC GACGACGTCG CGGAGGCGAC CGGCTTCGAC
ATCGCCGGGG CGGTGGGCGA ACTCGCCGAG GAGGTCGACA CGATCGGCGG CCTCGTCGTC
ACCATCACCG GGCGGGTGCC GTCCCGGGGC GAGGTCGTGG CGGTTCCGGG CGACTTCGAG
GTCGAGGTGC TGGACGCCGA TCCACGCCGC ATCAAGCGGC TTCGCCTCCA CCACGGCCCG
GCCAAGCTCG CCGCCCCGGA GGAGCCCCTG GCCCTGCCGG CACCCCGCAC GCTCAACGGC
AGCGGCGCCC CGGTCGACGC CGGGGCGTGA
 
Protein sequence
MSNDRSRGAA ATAQPAPDED SPAREPWYDR LLTIFHLKPR EAPRDEITDA LAEAQSGDHA 
FSPVERAMLK NVLSLHRVRV DDVMVPRADI VAVPAEISLG ELLKVFRTAG HSRLPVYGDT
LDDPRGMVHI RDFVDHLATR AEAGAAHGAK SPARSPANSP ANSPAKPAAE PPPVIQGDGR
ARRPHLARTP DLCEVDLDLS LAATRILRPV LYVPPSMPAI DLLVRMQASR THMALVIDEY
GGTDGLISIE DLIEVVVGDI EDEHDVAEGH RVLRVDGEAE IYVADARASL DDVAEATGFD
IAGAVGELAE EVDTIGGLVV TITGRVPSRG EVVAVPGDFE VEVLDADPRR IKRLRLHHGP
AKLAAPEEPL ALPAPRTLNG SGAPVDAGA