Gene M446_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1987 
Symbol 
ID6132928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2218445 
End bp2219962 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content78% 
IMG OID641642218 
ProductSel1 domain-containing protein 
Protein accessionYP_001768886 
Protein GI170740231 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.461505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCT CGTCCCTCGC CCTCAAGCAC GCGCTCCTGT GGAGCGGCGA TCCCTGGTAC 
CGCGGGGCCT GGATCATCTG GCCGCAGGCC GCGGCCCTGC TCGCCGCCGG CTGGCTCGTC
CTCGGCGGCG CGCTCCCCGT CCCCGAGCCG GCCGCCCCCT GGGCGCGCCC GAAGGCGCCG
CTCACCGGCC CGGAGAGCAC GGCCCTGCGC GACCGCGCCC TCGGCGACCC GGCCGCGCTC
GCGCAGCTGC GGGCGGCGGC GGCGGAGGGC GGCGCGGAGG CGCAGTTCAG CCTCGGCACC
CTGTTCGATC CGACCCTCTC GCTGCGGCGC GCCACCACGG CGCCGGACAT GCGCCAAGCG
CTGGCGCATT ACCGGGCCGC GGCCGAGCAG GGCCACGCGG CGGCGCAGTT CAACCTCGGC
AACGCGCTCT ACTGGGGTAT CGGCGGCGTG CCGGCCGACC CGGCCGCCTC CCTGCCCTGG
ATCGAGAAGG CGGCGCAGCA GGGCGTCGTG CCGGCCCAGC GCCTCGCCGG CCTCGCGGCG
CAGCGCGGCG TCGGCATGGC GGCCGATCCG GCCCGCGCCG CGTCGTGGTT CCGCCGGGCC
GCGGAGGCGG GGGACGCCTT CGCGCAGGCC GAGCTCGGCT GGGCCTACGA GCGGGGGCTC
GGCGGGCTCC CCGCCGATCA GGCCGCGGCG GTGGGCTGGT ACCAGAAGGC CGCCGCGCAG
GGGAATGCCG GGGCCGAGCG GCTGCTCGGG GTGCACCTGC TGGAGGGGCG CGGCATCGCC
GCCAACAAGG CCCAGGCGAT GGAGCACCTC GCCCGGGCGG CGGGACGGGG CGACGCGGAG
GCCCAGGCCC GGCTCGGCTA CGCCTTCCTC ACCGGCGACG GCAAGCCGAT GGACCCGAAG
GAGGCGGTGT CCTGGTTCCA GAAGGCGGCG GACCAGGGCA ACACCTTCGC GCAGCGGCGC
ATGGGCCTCG CCTACCGCGA CGGGTCCGGC GTGCCGGCCG ACCGCGGCCT GTCCCTGCAA
TGGTTCCGCC GCGCCGCCGA GGCGGGCGAC GGGTTCGCGG AGGCCGAACT CGGCGCGGCC
TACGAGACCG GCACCGGCCT GCCCCGCGAC CCGGGCCAGG CCCTCGCCCT CTATCGCCGC
GCCGCGGAGC ACGGCGACCC GCTGGGGCAG GCCAGGACCG GCGAGGCGCT GCTCCTGGGC
ACGGGCGGGC CCCGCGATCC GGCCGCGGCC CTGCCCCTGC TCCAGCGCGC CGCGCAGCAG
AACCAGCCGC TCGCGCAGTA CTATCTCGGC ACGATGTACG ACCAGGGCAA CGGCGTGGCG
GCCAACCCGG CCGAGGCGGT CTCCTGGTAC CAGCGCGCGG CGCGCAACGG CAACGCCGCC
GCCCAGAACG CCCTCGGCGT GGCCTACGCG CGCGGCGCGG GCGTGCCGAG GGACCTCGCC
CAGGCGCGGG CCTGGTTCAG CCAAGCCAAG GCCAACGGCA ACCTCGCGGC CGCCAAGAAC
CTGGAGCAGC TGCGATAG
 
Protein sequence
MAGSSLALKH ALLWSGDPWY RGAWIIWPQA AALLAAGWLV LGGALPVPEP AAPWARPKAP 
LTGPESTALR DRALGDPAAL AQLRAAAAEG GAEAQFSLGT LFDPTLSLRR ATTAPDMRQA
LAHYRAAAEQ GHAAAQFNLG NALYWGIGGV PADPAASLPW IEKAAQQGVV PAQRLAGLAA
QRGVGMAADP ARAASWFRRA AEAGDAFAQA ELGWAYERGL GGLPADQAAA VGWYQKAAAQ
GNAGAERLLG VHLLEGRGIA ANKAQAMEHL ARAAGRGDAE AQARLGYAFL TGDGKPMDPK
EAVSWFQKAA DQGNTFAQRR MGLAYRDGSG VPADRGLSLQ WFRRAAEAGD GFAEAELGAA
YETGTGLPRD PGQALALYRR AAEHGDPLGQ ARTGEALLLG TGGPRDPAAA LPLLQRAAQQ
NQPLAQYYLG TMYDQGNGVA ANPAEAVSWY QRAARNGNAA AQNALGVAYA RGAGVPRDLA
QARAWFSQAK ANGNLAAAKN LEQLR