Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2371 |
Symbol | |
ID | 5831585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2620866 |
End bp | 2621984 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641368170 |
Product | Sel1 domain-containing protein |
Protein accession | YP_001639837 |
Protein GI | 163851794 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.312198 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTTTG AAGCCCTCTT CCCTCTGCGG GGGGAGGGAA GGGCGCGCGG GCGCACGGGG AAAGCCGCAC GCCGAACGGG AACCTTGTCC CTCGCGGTCC TGAGCGCCTG CGTCGTTCTC GCGCTCGATA GCGGCGCTCT GCACGCGGCC CCGAAGCAGC CCGCCCCGAA GGAGCCTGTC GCGCAGACGC CGGCCAAGCC GAACCTGTTC GACCTCAAGC GTGAGCTGCC CTCGCCCTAC TCGGCCAACG TCCAGGGGGC GCAGCCCACG ACGGCGAACC CGAATGCCGA CGCGGCCTAC GGGGCCTATC AGCGCGGTCG CTACGTCACC GCCTTCCGCG AGGCGACCAA GCGGATCGAG GCGAATCCGA AGGACGCCGC GGCGATGACG CTGCTCGGGG AACTCTACAA CCAGGGACTC GGGGTCAAGC CGGATCCCAA ACGCGCCCAC GAATGGTACC GGCTGGCGGC CGTGCAGAAC GATCCCAACG CCATGGCCTC CCTCGGCCTG ATGGCGATGG ACGGGCGCGG ACAGCCCAAG GACGAGAAGG CCGGCCGAAC CTGGCTCGAA CAGGCGGCCC GCAAGGGACA GCCCAGCGCC TGCTACAATC TCGCTCTGAT CCAGCTCGCG AGCGACAAGC CGGCGGATCT GGCCGCGGCC CTGGCCAATT TCCGGGCGGC GGCCGAGGCC GAGATCCCCG CCGCGCAATA CGCGCTGGGC GTGCTCTACC TCCAGGGCAA GGGCGTCTCC AAGGACACGA CCCAGGCCGC GCAATGGTTT CGGCGCGCCG CCGACAATGG CGATCTCGGC GCCGAGGTCG AGTTCGCGAT CCGGCTGTTC AACGGCGACG GCGTTCCCAA GGACGAGACC CGCGCCGCCC GCTACTTCCT GCACGCGGCC CAGCGCGGCA ACGCCATCGC CCAGAACCGG ATCGCCAAGC TCTACGTCGC CGGCCGCGGC GTGCCCAAGA ACCTGGTCGA GGCGGCGGCC TGGAACCTCA CCGCAGCCTC GCAGGGCCGC GCTGATGCCG GCCTCGATCA GGCGACCGCC GGCCTCAACG CGGACGAGCG CAAGCGCGCC GAGGCGCTGG CGGCGGACCG GGTGAGCCTC GCGCCCTAA
|
Protein sequence | MTFEALFPLR GEGRARGRTG KAARRTGTLS LAVLSACVVL ALDSGALHAA PKQPAPKEPV AQTPAKPNLF DLKRELPSPY SANVQGAQPT TANPNADAAY GAYQRGRYVT AFREATKRIE ANPKDAAAMT LLGELYNQGL GVKPDPKRAH EWYRLAAVQN DPNAMASLGL MAMDGRGQPK DEKAGRTWLE QAARKGQPSA CYNLALIQLA SDKPADLAAA LANFRAAAEA EIPAAQYALG VLYLQGKGVS KDTTQAAQWF RRAADNGDLG AEVEFAIRLF NGDGVPKDET RAARYFLHAA QRGNAIAQNR IAKLYVAGRG VPKNLVEAAA WNLTAASQGR ADAGLDQATA GLNADERKRA EALAADRVSL AP
|
| |