Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0299 |
Symbol | |
ID | 5832738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 334917 |
End bp | 335951 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641366084 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_001637794 |
Protein GI | 163849751 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.168302 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCT CCGACGCCAC CCGCCGCTCT CTCCTGCGCC TCATGGGGTT GGCTGGCGTG GCCGGACTCT CCCCTGCCCT GTCGGGTTGC GTGATGGACG GGCTCGCCAC CGGCGCCGTC GGCGAGACGC AGTCGGCGCT CGACGTGAGC CCGGTGCTCC TCATCGCCAC CACCCGTCGC CCCGCCGCCG GCAATCCGCC CAAAGCGCCG TTCTTCGGCT CGGAGCGCGG CCGGGGCCTG AGCTTTGCCG AGGCGCGCAT GACCGCGCCG GACCGCTCGC TGATCGGCAA GGTCTCGGCG GTGGTGGGCG GCGATTGGGG CGTCCGCTCC GTCGGCGACG TCACGACGGG GTCGGGCGCG GCGGCGGCCT TCGCCCAATC CGCCTTCGGC CGCGATGTGC TGATCTACGT CCACGGCTAC CGCGAGAGCT TCGAATCGGC CGCAATCAGC GCCGCCCGCC TCTCCGACGG TATCCGCTTC AACGGCGCTT CCGCCCTTTT CACATGGCCC TCGGCCGCGG CGACCTTCGA TTACGGCTAC GACCGCGAGA GCGCACTGTG GTCCCGCGAC GCGTTCGAGG ACCTGTTGAA GACCGTGGCG ACCACGCCGA GCGGCGGGCG CATCCACATC GTCGCCCACT CGATGGGCAC GCTCCTCACG TTGGAGACGC TGCGCATGCT GCGGGCCGAG GCCGGCGAGG CGGCGGTCGC CCGGATCGGC GCCGTGGTGC TTGCCGCGCC CGACATCGAC ATCGACCTGT TCACCAACGG CGTCGAGCGC CTGGGGCCGG ACGCCAAGCG CATCACCGTC ATCTCGGCGA CGAACGACCG CGCACTCGAA TTGTCGGGCG CCATTGCCGG CGGCGTCGTC CGCGCGGGCG CCGCCGACCG GGAGCGCCTG GAGGCTCTGG GCGTGCGCGT GGCCGATGCC TCGGATTACG GCGGCGGCCT CTTCAACCAC GATCTGTTCC TGTCGAACCG CGAGGTTCAG GCCGTCGTCA AGCGGGCCGT CTCGCGGGGC AGCAGCGGCA CCTGA
|
Protein sequence | MQPSDATRRS LLRLMGLAGV AGLSPALSGC VMDGLATGAV GETQSALDVS PVLLIATTRR PAAGNPPKAP FFGSERGRGL SFAEARMTAP DRSLIGKVSA VVGGDWGVRS VGDVTTGSGA AAAFAQSAFG RDVLIYVHGY RESFESAAIS AARLSDGIRF NGASALFTWP SAAATFDYGY DRESALWSRD AFEDLLKTVA TTPSGGRIHI VAHSMGTLLT LETLRMLRAE AGEAAVARIG AVVLAAPDID IDLFTNGVER LGPDAKRITV ISATNDRALE LSGAIAGGVV RAGAADRERL EALGVRVADA SDYGGGLFNH DLFLSNREVQ AVVKRAVSRG SSGT
|
| |