Gene M446_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1072 
Symbol 
ID6131520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1191534 
End bp1192805 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content71% 
IMG OID641641364 
Productpolysaccharide export protein 
Protein accessionYP_001768036 
Protein GI170739381 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.660733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.636279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACC AGGCCCCGCG CCGCCCGTCC CGCACGAGGG CGGGCCTCGT CCTGGGGGCG 
GTCCTCGCCC TCGCGGCGCA TCCCGCCGCG GCGGCCTACC TGATCGCGCC GGGCGACACG
CTCACGATCG AGGCGGTCGC GGTGCCCGAA CTGAAGGCCA AGAGCGTCGT CAACGGCGAC
GGCGAGGTCA CGGTGCCGCT GGTCGGCCAG GTCCCGGTCG CCGGGCTGAG CCTCGCCGAG
GCGCGGGCGA AGATCCAGTC GCTCCTGCCG TCGAAGGAGA TCCGCCGCCG CACCGACGAC
GGCCGGGAAT TTCCGCTGAT CCTCTCGGCC TCCGAGATCA ACGTCGCGAT CCTGGAATAT
CGACCGGTCT ACCTGAACGG CGACGTGGCC AAGCCGGGCG AGCAGCCCTA CCGGCCGGGC
ATGACCGTGC GCCAGACCGT GGCGCTGGCC GGGGGCTTCG ACATCCTCCG CTTCAAGATG
GACAACCCGT TCCTGCAGCT GTCGGACCTG CGCGACGCCT ACAACACGGC CTGGATCGAC
TACGCCAAGG AGCAGCAGCG GCTCTCGCGC CTGCGGGCCG AGCTCGACGG CAAGGGGGAG
CTCGACCGCA AGGCGGTGAT CGAGACGCCC GTCGCGCCCT CCGTGGCGAA GGAGCTCTCG
GAGGGCGAGC GCGCGATCCT GACGACGCGC AACGAGGACA TCGTCAAGGA GAAGCGCTAC
CTGACCGAGG CCGCCGCCAA GGAGAACGAG CGGGCCTCGG TGCTCAGCGA GCAGGAGCGG
CGCGAGAAGG AAGGCGTCCA GTCGGACACT GACGATCTCA AGCGCTACCA GGAATTGTTC
GATCGCGGCA ACGTGCCGAT GCCGCGCCTC GTCGAGGCGC GCCGCACGGT CCTGCTCTCG
GCCACCCGCG CCCTCCAGAC CATGGCGGTC CTCGCCTCGG TCGAGCGCGA GAAGGGCGAT
CTCGGCCGCA AGCTCCAGCG GGTCGACGAC ACCCGGCGCC TCGAGGTGCT GCGCGAGATC
CAGGACTCGA CCGCCAAGCT GGCGGCGATC CGCAGCCGGC TCCAGGCGGT CAGCGAGAAG
CTGCGCTACA CCGGAATGGT CAAGTCCCAG CTGGTGCGCG GCTTCGACAG CCAGCCGCAG
ATCTCGATCT TCCGCAGGTC CGGCGGCAAG ACGGCCCGGA TCCCGGCGGA TCACGACACC
GAGCTGCAGC CCGGCGACGT GGTCGAGGTC GCCTTGCAGG CCGAGGAGCC GCCCGAGGTG
CCGACCCGCT GA
 
Protein sequence
MVNQAPRRPS RTRAGLVLGA VLALAAHPAA AAYLIAPGDT LTIEAVAVPE LKAKSVVNGD 
GEVTVPLVGQ VPVAGLSLAE ARAKIQSLLP SKEIRRRTDD GREFPLILSA SEINVAILEY
RPVYLNGDVA KPGEQPYRPG MTVRQTVALA GGFDILRFKM DNPFLQLSDL RDAYNTAWID
YAKEQQRLSR LRAELDGKGE LDRKAVIETP VAPSVAKELS EGERAILTTR NEDIVKEKRY
LTEAAAKENE RASVLSEQER REKEGVQSDT DDLKRYQELF DRGNVPMPRL VEARRTVLLS
ATRALQTMAV LASVEREKGD LGRKLQRVDD TRRLEVLREI QDSTAKLAAI RSRLQAVSEK
LRYTGMVKSQ LVRGFDSQPQ ISIFRRSGGK TARIPADHDT ELQPGDVVEV ALQAEEPPEV
PTR