Gene Mext_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1000 
Symbol 
ID5835759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1077493 
End bp1078971 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content72% 
IMG OID641366782 
Producthypothetical protein 
Protein accessionYP_001638476 
Protein GI163850433 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT ATTATCCGTT GCTGGCACGC GCGCTCGACG CCCTGCCCGA CCGTTCCCCG 
GCCCTGCGCC GGGCCGTCTA CGACCGTGCA CGCAGCGCGC TGATCGCGCA GCTCCGCTCG
CTCGACCCGC CGGTGCCGGA AGCGGACATC GACCTCGAGC GCAAAGCGCT CGACACGGCG
ATCGGCCGCC TGGAGGCGGA ATACGAGGCG CCGCCCGCGG CCGTGACGAC GCCGGCCGAG
GAGCCTGCGG CCGCCGCTCC AGAGGCACCC CCTCCCCCGC CGCCCGAACC GACCCGGCCT
GAGCCGCTCT CGCCCGGCCC CCTGCCGCCG ACGCTGCCGG AGCCGGAGCC GCCGCAGACG
TCCGACGAGC CGCTGGTTCT GCCGCCGGCC TCGGTGCCGG CTGGAATCGG ATCGGACACC
GGACCGGCCG AGCCGAAGCC GCCGACGGAG ACGGTGCCGT TCATGCCGCC GACCCGCCGG
CCGAAAGCCG ACGAGGCCGT GAAGCCTGAG CCGGAAAACG AGAACGGCTT CATCCCACCG
GTTGCCGAGC CTGAGCCGGT CTCCGTCGCG TCCGAGGCCG AGGCCGGCGC CGATCCGGCC
TCACCCGAGA CGAACGGAGC CGGCGAGGCG GGCAACGGCC GCCAGCGCCC GCGCATCGAC
GTGGTGACGC CGCCCGAGGG GCGTTCGCGC CTGCTGCGCA ACCTGTTTGT CGGCGGCGTG
CTCGCGGCGG TGATCGCGCT GATCGCGGTG GCGGCTTTCT TCCTGCGTGA CCAGCCCTCC
GATCTCCAGC AGAGCGCGGC CGAGCAGGAG ACGCCGGCCG AGCAGCCGGA CGCGAAGTTC
TCGGATCGGG TCGGAGCCGA GCGCAACGAG GCCGAGGCCC GGCCGAAGCC GGCCGCTCCC
GGCGCCGCCC CGGCCCAGCC GGAGGTGACC GTCTCGCAGC GGGCGATCCT CTATGAGGAG
AACCAGAGCG ACACGCGCGC CCAGCCGATC GCGACCAACG GCCATACGGT CTGGCGGCTG
GAAGCGGTGA ACGGCGAACA GGGCGAGCCG TTGCAGACGG CACTCCGCGT CAACGTGGAG
TTCCCGGAGG CGGGGCTGAC GCTGGCGATG ACCATGCGCA AGAATCTGGA TGCGACGCTG
CCCGCGTCTC ACACCGTCGA ACTCGCTTTC ACCAACAACG CGGATGCCGG CGCGCAGCGC
GCGGTGCAGA ATATCGGCCT GCTTCAGCTC AAGGACGAGG AAGCCTCCCG CGGCTCCCCG
GTCTCGGGCC TGCCGGTGCG GGTGCGCGAG AACCTGTTCC TGATCGGTCT GTCGTCGCTG
AAGAGCGACG TGGACCGCAA CACCGAGCTG CTGCTGCACA AGAACTGGTT CGATCTGGCC
CTGACCTACG CGAACGGCCA GCGGGCGGTC ATCAGCTTCG AAAAGGGCAG CGCCGGCGCC
CAGGCTCTGC AGAGCGCCTT CGCGCAGTGG CGCGACTAA
 
Protein sequence
MADYYPLLAR ALDALPDRSP ALRRAVYDRA RSALIAQLRS LDPPVPEADI DLERKALDTA 
IGRLEAEYEA PPAAVTTPAE EPAAAAPEAP PPPPPEPTRP EPLSPGPLPP TLPEPEPPQT
SDEPLVLPPA SVPAGIGSDT GPAEPKPPTE TVPFMPPTRR PKADEAVKPE PENENGFIPP
VAEPEPVSVA SEAEAGADPA SPETNGAGEA GNGRQRPRID VVTPPEGRSR LLRNLFVGGV
LAAVIALIAV AAFFLRDQPS DLQQSAAEQE TPAEQPDAKF SDRVGAERNE AEARPKPAAP
GAAPAQPEVT VSQRAILYEE NQSDTRAQPI ATNGHTVWRL EAVNGEQGEP LQTALRVNVE
FPEAGLTLAM TMRKNLDATL PASHTVELAF TNNADAGAQR AVQNIGLLQL KDEEASRGSP
VSGLPVRVRE NLFLIGLSSL KSDVDRNTEL LLHKNWFDLA LTYANGQRAV ISFEKGSAGA
QALQSAFAQW RD