Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1055 |
Symbol | |
ID | 4078113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1134335 |
End bp | 1135489 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006359 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_613050 |
Protein GI | 99080896 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.921254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.096534 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGCA GTCTGGCCTC GAACATGCTG ACGATCCTGA TCGTCGGCCT GTTTCTCTTC GCGGGGGTGA TCCTGTGGGG CAAGAATGAA TACACCGCCG AGGGCCCCCT GTCCGAAGCG ATCTGTTTTC AGGTGCCCTC GGGGACCAAT ATGGCACGGG TGTCGCGCCG TCTTGAAAGC GACGGCGTGG TCTCGAGCGG CACCATCTTT CGCATCGGGG TGAAATATTC CGACAAGGCG CAGGACCTCA AGGCAGGCAG CTACCTTGTG GAGCCCGGCG CTTCGATGGA AGGGATCGTG GATCAGATCA CCCGGGGCGG GGCCTCCACC TGTGGCACCG AGATCGTCTA TCGCGTTGGC GTGACCCGCG TGCTGGCCGA GGTGCGCGAG CTGGACCCGG CGACCAACGC CTTTGTGGAA CGTGCTGAAT TTGTCCCCGG CGTGGATGAG ACCCCCGCGG TCTATACCGA GAAAAAATCC GAGGCTGACA CGCGCTACCG TATTGCACTG GCAGAAGGCG TGACCAGCTG GCAGGTGGTC GAGTCCCTGA AGGCGATGGA CATTCTCGAG GGGGAGCCGG GCCGCCGTCC GCCGGAGGGC AGCCTTGCGC CCGACAGCTA CGAGGTCCGC CCCGGAACCT CGCGCGAGGC GGTGCTGGCC GAGATGCAGG CGCGGCAAGA CAGGCGGATC GCGGACGCCT GGGAAGCCCG TAGCCCCGAT GCGGCTGTCA AAACCCCAGA GGAAATGCTG ATCCTCGCCT CGATCATCGA GAAGGAAACC GGCGTTGCTG AGGAACGCGG TGTGGTGGCC TCTGTCTTCA CCAACCGCCT GCGGCGCGGC ATGCGCCTTC AGACCGACCC CACGGTGATC TATGGTGTGA CCAAGGGAGA GGGCGTTCTG GGCCGGGGCC TACGGCAGAG CGAGCTGCGC GGCGCAACGC CGTGGAACAC GTATGTGATC GAAGGTCTTC CACCAACACC CATCGCCAAT CCCGGGCTTG AAAGCCTGGT GGCGGCAGTG AACCCGGATC AGACTGACTA TGTGTTCTTT GTGGCCGATG GCACCGGCGG CCATGCTTTT GCCGAAACGC TCGAAGAGCA CAATCGCAAC GTCGCGAAAT GGCGAAAGAT CGAAGCCGAG CGCAACAACA ACTGA
|
Protein sequence | MWRSLASNML TILIVGLFLF AGVILWGKNE YTAEGPLSEA ICFQVPSGTN MARVSRRLES DGVVSSGTIF RIGVKYSDKA QDLKAGSYLV EPGASMEGIV DQITRGGAST CGTEIVYRVG VTRVLAEVRE LDPATNAFVE RAEFVPGVDE TPAVYTEKKS EADTRYRIAL AEGVTSWQVV ESLKAMDILE GEPGRRPPEG SLAPDSYEVR PGTSREAVLA EMQARQDRRI ADAWEARSPD AAVKTPEEML ILASIIEKET GVAEERGVVA SVFTNRLRRG MRLQTDPTVI YGVTKGEGVL GRGLRQSELR GATPWNTYVI EGLPPTPIAN PGLESLVAAV NPDQTDYVFF VADGTGGHAF AETLEEHNRN VAKWRKIEAE RNNN
|
| |