Gene Mext_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2371 
Symbol 
ID5831585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2620866 
End bp2621984 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID641368170 
ProductSel1 domain-containing protein 
Protein accessionYP_001639837 
Protein GI163851794 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.312198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTTG AAGCCCTCTT CCCTCTGCGG GGGGAGGGAA GGGCGCGCGG GCGCACGGGG 
AAAGCCGCAC GCCGAACGGG AACCTTGTCC CTCGCGGTCC TGAGCGCCTG CGTCGTTCTC
GCGCTCGATA GCGGCGCTCT GCACGCGGCC CCGAAGCAGC CCGCCCCGAA GGAGCCTGTC
GCGCAGACGC CGGCCAAGCC GAACCTGTTC GACCTCAAGC GTGAGCTGCC CTCGCCCTAC
TCGGCCAACG TCCAGGGGGC GCAGCCCACG ACGGCGAACC CGAATGCCGA CGCGGCCTAC
GGGGCCTATC AGCGCGGTCG CTACGTCACC GCCTTCCGCG AGGCGACCAA GCGGATCGAG
GCGAATCCGA AGGACGCCGC GGCGATGACG CTGCTCGGGG AACTCTACAA CCAGGGACTC
GGGGTCAAGC CGGATCCCAA ACGCGCCCAC GAATGGTACC GGCTGGCGGC CGTGCAGAAC
GATCCCAACG CCATGGCCTC CCTCGGCCTG ATGGCGATGG ACGGGCGCGG ACAGCCCAAG
GACGAGAAGG CCGGCCGAAC CTGGCTCGAA CAGGCGGCCC GCAAGGGACA GCCCAGCGCC
TGCTACAATC TCGCTCTGAT CCAGCTCGCG AGCGACAAGC CGGCGGATCT GGCCGCGGCC
CTGGCCAATT TCCGGGCGGC GGCCGAGGCC GAGATCCCCG CCGCGCAATA CGCGCTGGGC
GTGCTCTACC TCCAGGGCAA GGGCGTCTCC AAGGACACGA CCCAGGCCGC GCAATGGTTT
CGGCGCGCCG CCGACAATGG CGATCTCGGC GCCGAGGTCG AGTTCGCGAT CCGGCTGTTC
AACGGCGACG GCGTTCCCAA GGACGAGACC CGCGCCGCCC GCTACTTCCT GCACGCGGCC
CAGCGCGGCA ACGCCATCGC CCAGAACCGG ATCGCCAAGC TCTACGTCGC CGGCCGCGGC
GTGCCCAAGA ACCTGGTCGA GGCGGCGGCC TGGAACCTCA CCGCAGCCTC GCAGGGCCGC
GCTGATGCCG GCCTCGATCA GGCGACCGCC GGCCTCAACG CGGACGAGCG CAAGCGCGCC
GAGGCGCTGG CGGCGGACCG GGTGAGCCTC GCGCCCTAA
 
Protein sequence
MTFEALFPLR GEGRARGRTG KAARRTGTLS LAVLSACVVL ALDSGALHAA PKQPAPKEPV 
AQTPAKPNLF DLKRELPSPY SANVQGAQPT TANPNADAAY GAYQRGRYVT AFREATKRIE
ANPKDAAAMT LLGELYNQGL GVKPDPKRAH EWYRLAAVQN DPNAMASLGL MAMDGRGQPK
DEKAGRTWLE QAARKGQPSA CYNLALIQLA SDKPADLAAA LANFRAAAEA EIPAAQYALG
VLYLQGKGVS KDTTQAAQWF RRAADNGDLG AEVEFAIRLF NGDGVPKDET RAARYFLHAA
QRGNAIAQNR IAKLYVAGRG VPKNLVEAAA WNLTAASQGR ADAGLDQATA GLNADERKRA
EALAADRVSL AP