Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0917 |
Symbol | |
ID | 5832781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 994389 |
End bp | 995378 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641366699 |
Product | extensin family protein |
Protein accession | YP_001638393 |
Protein GI | 163850350 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGTA AAGCGTTAGC GTTCTCGGCT CTGGTGCTGT TCGGCGCGGG GCTCACGGGC TGTGCGATCA ACCGGTTCGA GCGCCGGGAA GCATGGCGTG ACCAAGCCGA ACAGATGTGC ATCGCGCGCA AGCTCGTGCA GCCGACGGCC TATGTCTCGC TCGCCAAGGA GATCGATGGC CCCGGCCCCT GCGGCATGCA GCAGCCGTTC AAGGTCACCC GGCTCGGCGG CGGCACGGTG GCGCTCAAGC AGCGCATGAC CCTGGCCTGC CCGGCGCTCG CCGAGGCCGA GGCGTGGCTC GCCGACACGA TCCAACCCGC CGCCAACCTC TATTTCGGCG TGCCGGTGGC CGAGATCAAC GCGGGCACCT ATTCCTGCCG CGGCCGCAAC AACCAAGCCG GCGCCAAGCT CTCCGAGCAT TCGTTCGGCA ACGCGCTCGA CATCATGTCC TTCACGCTCG CCGACGGACA CGTCATCACC GTCAAGGGCG GCTGGCGCGG CACCGAGGCC GAACAGGCCT TCCTGCGCGA GGTGTTCGTG GGCGCATGCG CCCGGTTCTC GACCGTGCTG GCGCCGGGTT CCAACGTGTT CCACTACGAC CACATCCATG TCGATCTGGC GATGCACGAC CCGCGCGGCC TGAAGCGGAT CTGCAAGCCG CTGCTCAAGT TCGAGTCGCA GCTCAACCTC GCCGACGGCT CGCCGCGGCC GCTGGCCTCG CCGCGCCCGC CTGCGCGCCA GACCGTCCCG ACCCAGGCCC CGATCGACGT CGAAGAGGAC GATCCCTACG GCGTCGCACC GACCTCCTCG CGCACGACCG GCACGCGCGT CGCCCGCGCG CCGGCCGCCC CGGCGCCGAC GGCCTATGCC GCCGCTCCGG CCCCGAGCCG GCCCCGCTCC CCGGTTCCGG CGCATGACGC GGCCTACGCG CCGCTCTCGC TGGCCGCGCC CCATGCCTCG GACCACGCTT CGGACGAGCC GATCTATTAA
|
Protein sequence | MWRKALAFSA LVLFGAGLTG CAINRFERRE AWRDQAEQMC IARKLVQPTA YVSLAKEIDG PGPCGMQQPF KVTRLGGGTV ALKQRMTLAC PALAEAEAWL ADTIQPAANL YFGVPVAEIN AGTYSCRGRN NQAGAKLSEH SFGNALDIMS FTLADGHVIT VKGGWRGTEA EQAFLREVFV GACARFSTVL APGSNVFHYD HIHVDLAMHD PRGLKRICKP LLKFESQLNL ADGSPRPLAS PRPPARQTVP TQAPIDVEED DPYGVAPTSS RTTGTRVARA PAAPAPTAYA AAPAPSRPRS PVPAHDAAYA PLSLAAPHAS DHASDEPIY
|
| |