Gene Mext_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1201 
Symbol 
ID5831507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1325138 
End bp1326334 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID641366994 
Producthypothetical protein 
Protein accessionYP_001638674 
Protein GI163850631 
COG category[S] Function unknown 
COG ID[COG3503] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.119748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTC GCCAGGAGGC GGCCACGGCT GAGTCGGCAC CGGCTCCCAT TGCCGTGGGG 
GACGCGCCAC GCGCCCGCAT CGGCGCCATC GATGCCCTGC GCGGGCTCGT GATGCTGCTG
ATGCTGGTCG ATCACGCGCG GGAGTTCTTC TACCTGCACG CCCAGGTGAG CGATCCGGTG
AACCTCGCGG CGACCCCGCC CGGCCTGTTC CTGACGCGGG CCGCCTCGCA TTTTTGCGCC
CCGGTCTTCC TGCTGCTGAC CGGGCTCTCG GCGAGCCTCT ACGGGCAGAA GCACGGCAGC
CGCGCCGCGA CCTCCGCCTT CCTCATCAAG CGCGGACTCT TCCTGGTCGC CCTCGAAGTG
ACCCTGGTGA ACCTCGCCTG GACCCGTGCC CTGCTGCCGC CGATCCTCTA CCTGCAGGTC
ATCTGGGCGA TCGGTCTCAG CATGATGGCG CTCGCTGCCC TGCTCTGGCT GCCGCGTCCG
GCGCTGATCG GCGTAGGCCT GGCGCTCATG CTCGGGCACA ACGCGCTCGA CGGGATCGTG
CTCGCGCCCG ATCAGCCGGG CTACGCCCTG TGGGCGGTGC TGCATCAGCG CGGCCTGATC
CCGCTGCCCT GGGGCGCGGC GCGCACCTCC TATCCGGTGC TGCCCTGGAT CGGTGTGATC
GCCGCGGGCT ACGCGCTGGG GCCGCTCTAC GGCGCGGGCG TCGATCCGGC CGCGCGCCGC
CGAGGCCTCG TCGCCCTCGG GCTCGCCAGC CTCGTCGCCT TCCTGATCCT GCGCGGCCTC
AACGGCTACG GTGATCCGCA TCCGTGGCAG GCGGGCAAGG ATTGGGGCGC GGACGCCCTG
TCCTTCTTCA ACCTCACCAA ATATCCGCCC TCGGCCGACT TCCTGCTCGC GACCCTCGGT
CCCGGCCTGC TGCTGCTCGC GCTGTTCGAG CGCCTGCCGG AACGGCATCT AGCCTGGCTC
ACCGTCTTTG GCGGGGCGCC GCTGTTCTTC TACCTGCTGC ACTTATGGGC CCTGCGCCTG
GCCTATGACG GCCTTGCCAC CTTCGGCCTT GCGGGTCCCT CCGGCCGGAT CGAAGTCGGC
GCCCCGTATC AGATCTGGCT GATCGCGGCG CTGTTCGCGC TGGCCCTCTA CCCGGCCTGC
CTCTGGATGG TTCGGCTGAA GCGGCGCAGC CGGTGGCGGG GTCTGAGCTA CCTCTGA
 
Protein sequence
MRARQEAATA ESAPAPIAVG DAPRARIGAI DALRGLVMLL MLVDHAREFF YLHAQVSDPV 
NLAATPPGLF LTRAASHFCA PVFLLLTGLS ASLYGQKHGS RAATSAFLIK RGLFLVALEV
TLVNLAWTRA LLPPILYLQV IWAIGLSMMA LAALLWLPRP ALIGVGLALM LGHNALDGIV
LAPDQPGYAL WAVLHQRGLI PLPWGAARTS YPVLPWIGVI AAGYALGPLY GAGVDPAARR
RGLVALGLAS LVAFLILRGL NGYGDPHPWQ AGKDWGADAL SFFNLTKYPP SADFLLATLG
PGLLLLALFE RLPERHLAWL TVFGGAPLFF YLLHLWALRL AYDGLATFGL AGPSGRIEVG
APYQIWLIAA LFALALYPAC LWMVRLKRRS RWRGLSYL