Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0485 |
Symbol | |
ID | 5831294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 529146 |
End bp | 530366 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641366264 |
Product | hypothetical protein |
Protein accession | YP_001637973 |
Protein GI | 163849930 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.626037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA CCCACTCCTC TCCCGCCCAG CCCAGTCGCG CGACCCGGCC CGTGGTCGCC AGCGGCCTCG TCGCACTTGC AATGGCGATG GGGGTCGGCC GTTTCGCCTT TACGCCGCTG ATGCCGCTGA TGATCCGTGA CGGTACGTTG GACGCTACCA CCGGTACGGA ATGGGCGGCA GTCAACTATG TCGGGTATTT CGTGGGTGCC CTGACCGCCT CGTGGTTCAG CGGCAACCCG CGTCGCGGTC TGCTGCTGAG CCTGATCGGT GTCGCTCTCA CGACACTGGC GATGGTGGCA GTTGATGCCG TTCCCACCAC CCTGCTCGGG GTCATGCTGC GCGGGGCAGC TGGCGTGTTC AGCGCCTGGG CGTTGGTGTG CACGAGCAGC TGGTGCCTGG CCGAACTTGC CCGGCGTCGG GCCGGGCAAC TGGGCGCGTG GATCTACACG GGTGTCGGTC TCGGCATCGC GTTAGCCGGT GTGCTGGCTT GGCTTGGCGG ACGCCAGCGG GCGGACTGGC TCTGGCTTGA ACTAGGGCTC ATCGCCAGTG CCGGGGCGGT GCTCGTTTGG ACGCAGTCAC GGGGGCAAAG CACGATCCCG GCCGAGATCG AAGAACGCGA GGCTACAGCA ATCGCTCCGA CGCGAGGAAG CGGGCAGTTG GCTCTGGTGC TCTGCTACGG AATCTTCGGG TTCGGCTACA TCGTGCCGGC CACGTTCCTG CCGGCCATGG CGCGCGAGCT AGCTCCCGAT CCCCTGGTGT TCGGGTTGAC TTGGCCCTTG TTCGGCCTCG CCGCCGCTCT GTCGGTCGCG GCCGTGGCCC ACTGGCTGCC AAGCACATCG CGTCAACGAC TGTGGGCTCT GTCACAGGGC GTCATGGCGC TCGGCACCGC CCTGCCGCTG TTCGTCCAAG CTCTCTGGGC TGTCGCGGCC TCAGCGGTCT TGGTCGGCGG CACGTTCATG GTAGCGACCA TGGCCGGCTT GCAGCTCGCC CGCGAGGCGC AGCCGGACAA TCCGACCCCG CTCCTTGCGC GAATGACCGC TGCCTTCGCC GCCGGACAGA TCGCTGGCCC ATTGCTGGTT CGCGCGCTTG GTTCCGGCCG CTGGGCCGGC TGGGATGCGC TGGGGTGGAC GGGCGCTCTC GCTACGCTGC TGCTAGTGCT GACGGCAATA TGGCTCTGGC GCAGCACCAA ACCTTCCCTC GAAAGCCTGA GGCCCGTCTG A
|
Protein sequence | MSTTHSSPAQ PSRATRPVVA SGLVALAMAM GVGRFAFTPL MPLMIRDGTL DATTGTEWAA VNYVGYFVGA LTASWFSGNP RRGLLLSLIG VALTTLAMVA VDAVPTTLLG VMLRGAAGVF SAWALVCTSS WCLAELARRR AGQLGAWIYT GVGLGIALAG VLAWLGGRQR ADWLWLELGL IASAGAVLVW TQSRGQSTIP AEIEEREATA IAPTRGSGQL ALVLCYGIFG FGYIVPATFL PAMARELAPD PLVFGLTWPL FGLAAALSVA AVAHWLPSTS RQRLWALSQG VMALGTALPL FVQALWAVAA SAVLVGGTFM VATMAGLQLA REAQPDNPTP LLARMTAAFA AGQIAGPLLV RALGSGRWAG WDALGWTGAL ATLLLVLTAI WLWRSTKPSL ESLRPV
|
| |