Gene Mext_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1203 
Symbol 
ID5832216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1329285 
End bp1330475 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content75% 
IMG OID641366996 
Producthypothetical protein 
Protein accessionYP_001638676 
Protein GI163850633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.403327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0837257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG CCGAGCTCGT CCAGAGTCGC TCCGCCCGGC CCGTCATCCG CCGTCGGCGC 
CGCCCGCCTG TCGCGGCGCT CGCGCTGACC GCCCTCGGGG GTCTCGCGGT CGCCGGGATG
CTGCAACCGC AGGATGGCGC CGCGCCCGCC CCCGTGGAAA CCGCCCAGGT CTCGTCCGAG
ACCGACGGGG CGGAGACCGT GCAGGCAGCG CCCGGCGCGC TGGCCTGGAT GCTCGACCCC
ACCCCCGTGC TCGACACCGC GTCCCGCGGC TTCGTCCCGC GGACCGCGCA GGTGTCCGCC
TTCCGGGCGC CGCCCGTGGC GGAGCCTTCC AAGTCGCAAC CTGCCCCATC GGAGCCGGTC
ACGACCCTGG CCGCCCTCCC CGCGGAGCAG GCTGCCGACG CCGCGCCCGC CATCACACCG
CGCCCGGCCG CGCCGACGGC TCGCCTCGCC CTCACGGTGC CGCTGCCGGT GCGGCGGCCC
GACGAATTCC GCTACGAGCC GCGCACGCAG ACCGCCCGGG CGGCGACGCC TCGGGTAGCC
ACCGCATCCA CGGCGCGCGC CTCCGCCGAG CCGACCCGCA GCGTGTTCAG CGCCGCGGTC
ACGGACAACC GCAACTTCCT CGAGAAGATG TTCGATTTGC CGGGTTCGAC GCCCACGCCG
TCCTCGGACA AGGCAATGGC CTATGCCGGA CTCGACAGCG GGGCGGTCGA TTCAGCCGCG
CGGCGCCGCG TCGTGCCGGG CCCGGTCTCC GAGCCGGGGG TCGCCGTCTA CGACATCAGC
GCCGCCACCG TGACGCTGCC GAGCGGCGAG GTGCTGGAGG CCCATTCCGG CCTCGGCGTC
GCCCAGGACA ACCCGGACCA CGTCCATGTC CGGATGCGCG GCGCCACCCC GCCCGGAGTC
TACGACCTGC GGGAGCGTGA GGCGCTGTTC CATGGGGTGC GGGCGATCCG CCTCAATCCG
GTCGGCGGCT CGGCCGCCAT CCACGGGCGT GACGGCATCC TCGCCCATAC CTACATGCTG
GGCCCGAGCG GCGCCTCGAA CGGCTGCGTC GTGTTCCGGA ACTACACCCG CTTCCTCCAG
GCCTATCTGA GTGGCGAGGT CCGGCGCCTG GTCGTGGTGC CCGGCACGGC CCCCGGAATC
TTCGCCAGCC GGCGCAACGC CACGCCGCGC CGCTCGGCCT CGGCGGATTA A
 
Protein sequence
MDIAELVQSR SARPVIRRRR RPPVAALALT ALGGLAVAGM LQPQDGAAPA PVETAQVSSE 
TDGAETVQAA PGALAWMLDP TPVLDTASRG FVPRTAQVSA FRAPPVAEPS KSQPAPSEPV
TTLAALPAEQ AADAAPAITP RPAAPTARLA LTVPLPVRRP DEFRYEPRTQ TARAATPRVA
TASTARASAE PTRSVFSAAV TDNRNFLEKM FDLPGSTPTP SSDKAMAYAG LDSGAVDSAA
RRRVVPGPVS EPGVAVYDIS AATVTLPSGE VLEAHSGLGV AQDNPDHVHV RMRGATPPGV
YDLREREALF HGVRAIRLNP VGGSAAIHGR DGILAHTYML GPSGASNGCV VFRNYTRFLQ
AYLSGEVRRL VVVPGTAPGI FASRRNATPR RSASAD