Gene Mext_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3462 
Symbol 
ID5832431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3842384 
End bp3843550 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content70% 
IMG OID641369260 
Productpolysaccharide export protein 
Protein accessionYP_001640918 
Protein GI163852875 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.775015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.722378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCTC TAGCCATTGC TCTCCTCACC GCTACGGCGG TGTCGGGGTG CTCCATCCTG 
CCGGCCGCTG GGCCGACCAC CTCGGCGATC GAGGGCGGCG CCGACGTCGC CACCGCCGAA
GGTCTGTTTG CCCGCTACGA GATCATCGAC ATCACCCCCG CCCTCGTCGA GGCCCTGCGC
ACCCGTCCCC TCGACAGCCT CCTCGTCACC TTCGGTGACA GCCGCCCGGC GCTCGAACCG
GTGATCGGCG TCGGCGACTA CGTGTCGGTG CAGGTCTGGG AAGCCGGTTC CGGCGGCCTG
TTCTCCGGGC CTCTCGTCTC CGACCGCTTC TCGGCCGGCT CGAAATCCGC GATGATCCCC
GAGCAGGTGG TCGGGCCCGA CGGCGGCATC ACCGTCCCCT ATGCCGGCCG CATCAAGGTC
GTCGGACGCC GCACTCAGGA CGTCCAGGCG CTGATCGAGA CCGAACTCGC CGGCAAGGCA
ATCCAGCCGC AGGTGCTCGT CTCCGTCACC AAGCCGGTCT CGCAATCGGC CACCGTCACC
GGCGAAGGCG CCATGGGCAT GCGCGTGCCG CTGTCGGGCC GCGGCGACCG CCTGCTCGAC
GTCATCGCTC AGGCCGGCGG CGTGCGCACC CCGGTGGCCG AGACCTTCGT GCGGCTCTCG
CGCGGCAACC GCACCGTCAC CGTGCCGATG ACCACGGTGG TCGCCAATCC GCGCGAGAAC
ATCTTCGTGC GGCCGAACGA TACGCTGACC CTGGTGCGCG ATCCGCAGAC TTTCCTGGCC
GTGGGTGCGC TCGGCAACAC CACCGAGGTG CCGTTCACGG CGGACGGGCT GACGTTGTCG
CAGGCGCTGG CACGCGCCTC CGGCCTGCGC GAGTTCCAGG CCGATCCGGC GGGCGTGTTC
ATCTTCCGCT ATGAGCCGGC GGCGGTGGTG CGGCGGCTGC GGCCGAACTC GCCGCTGCTG
GCCTCAGGGC AGGTGCCGGT GGTGTACCGA GTGAACCTGC GCGACGCGCA AGGCATGTTC
CTGACCCAGA GCTTCCGCAT GCGGAACCGC GACCTCGTCT ACGTGTCGAG TTCGCCATTC
GCGGAGCTGG GCAAGGTGCT GGGCGTGTTC TCGACCGTGG CCTCGCCCAT CGCAGCGGGC
GCCTCGATCT ACACTGTCAC GCGGTGA
 
Protein sequence
MRALAIALLT ATAVSGCSIL PAAGPTTSAI EGGADVATAE GLFARYEIID ITPALVEALR 
TRPLDSLLVT FGDSRPALEP VIGVGDYVSV QVWEAGSGGL FSGPLVSDRF SAGSKSAMIP
EQVVGPDGGI TVPYAGRIKV VGRRTQDVQA LIETELAGKA IQPQVLVSVT KPVSQSATVT
GEGAMGMRVP LSGRGDRLLD VIAQAGGVRT PVAETFVRLS RGNRTVTVPM TTVVANPREN
IFVRPNDTLT LVRDPQTFLA VGALGNTTEV PFTADGLTLS QALARASGLR EFQADPAGVF
IFRYEPAAVV RRLRPNSPLL ASGQVPVVYR VNLRDAQGMF LTQSFRMRNR DLVYVSSSPF
AELGKVLGVF STVASPIAAG ASIYTVTR