Gene M446_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1202 
Symbol 
ID6133886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1322985 
End bp1323989 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content72% 
IMG OID641641490 
Producthypothetical protein 
Protein accessionYP_001768161 
Protein GI170739506 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGT CGATCGAGCG GCGCGGTGAG GTAGGTGACG GGAAGCCGTC GAACGGAGCG 
CGTGCCTACG TCGGCCTCGA CGCCCTCCTG CGCCTTCGGC ATCGGGCCAA GGGCTTCAGC
TTCCTGCCGC GGCAGCCCGT GCACAGCCTG CTCTCGGGCC GGCACGCCTC CCGCCTGCGC
GGACGGGGCC TGAACTTCGA GGAGCTGCGC CACTACTTCG AGGGCGACGA CACCCGCACC
GTCGACTGGC TGGCGACCGC GCGCCTCGGA GCGCCGCACG TCCGCGTCTA CTCGGAGGAG
CGCGACCGGC CGGTCCTGCT CCTCGTGGAC CAGCGCACCA CGATGTTCTT CGGGAGCCGG
CGGGCAATGA AGTCGGTCGT CGCGGCCGAA GTCGCGGCGC TCGCGGCCTG GCGGGTCACA
TCGCTCGGCG ACCGGGTCGG TGCGGTCGTG TTCGGCGACG TCGACATGAC CGAGGTGAGA
CCGCAGGGGC GAGAGACGGG GGCCGTCCGG GTCATCGCCG AGGTCGCGCG TCGGAACGGA
CTTCTCGGTA CCGGCGCCCC TTCTCCCGGA GCCGACGGGC AGCTCAACGA GGCCCTGCGC
CGGGCCGAGC GCCTGGCGAC GCACGACTGG CTCGTCTGCC TCGTCACCGA CGCCGCGGGC
GAGGACGCCG AGACCGCGCG CCTCGTCACG CGCCTCACGG CCCACAACGA TGTCCTGACG
GTGCATGTCC ACGACCCGCT CGAACTCGAG CTCCCTGACG TGGGGCCCGC GGTGTTCAGC
TCGGGCGAGG CACAGGTCGA GGTCGATTCC TCCTCGAAGA GCCTGCGCAG CCGCTACTCG
GACGATCGCG CCGAGTGGCG GGAGCGCCTG GGCGCGCTCT CGCTCCGGCG AGCCATCCCC
GTGCTCCCGG TCACGACCGC CCAGGATGTC GCGACCCAGT TGCAGGCGCT GATCGGGAGA
AGGGCCGAGC GTCGTCTCAC CAGCGTCGGA GGGACAGCCC CATGA
 
Protein sequence
MARSIERRGE VGDGKPSNGA RAYVGLDALL RLRHRAKGFS FLPRQPVHSL LSGRHASRLR 
GRGLNFEELR HYFEGDDTRT VDWLATARLG APHVRVYSEE RDRPVLLLVD QRTTMFFGSR
RAMKSVVAAE VAALAAWRVT SLGDRVGAVV FGDVDMTEVR PQGRETGAVR VIAEVARRNG
LLGTGAPSPG ADGQLNEALR RAERLATHDW LVCLVTDAAG EDAETARLVT RLTAHNDVLT
VHVHDPLELE LPDVGPAVFS SGEAQVEVDS SSKSLRSRYS DDRAEWRERL GALSLRRAIP
VLPVTTAQDV ATQLQALIGR RAERRLTSVG GTAP