Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2584 |
Symbol | |
ID | 4077495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2721455 |
End bp | 2722450 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638007908 |
Product | aminotransferase |
Protein accession | YP_614578 |
Protein GI | 99082424 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000040763 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGAAC GCTCTTCCTT TGCGGCAACG CAAGACCCGG CGCAGGCCTC GGTCTCCGGT TCCCCCCGCG ATCATGGCGG CAATCTCGCC GCCGCGATCG CACAGTATGG CGGCAGCCGC GAAGACTGGA TTGATCTCTC CACAGGGATC AACCCTGCAC CCTACCCTCT GCCGCCATTG CTGGCCGAAG ACTGGACCGC TCTGCCCGAT CAAGATGCCC AGCAGGCTCT GACGGATGCA GCGCGCCGGT TCTGGCAGGT ACCCGACAGC GCAGAGATCC TTGCCGCCCC CGGCGCTTCG GCGCTGATTG CACGGCTCCC TGTCCTGCGC CCGGCACGCA GCGTGCAAAT CACGACGCCC ACCTACAACG AACACGCCGC TGCATTTGAC AATTGTGGCT GGCAGGTGCT TGAGGTGGGT CCAACGGACG CGCAAGTTGT GGTTCACCCC AACAATCCCG ACGGCCGCCT GTGGCTGCCC GAGGAACTCA AGGCACCTTT CTGCATCATT GACGAGAGTT TCTGTGACAT CTGCCCGGAG CGGAGTCTGA TCAACCGCGC AGCACAGCCG GGCACCGTGG TCCTCAAGAG CTTTGGCAAG TTCTGGGGGC TCGCGGGGCT GCGGCTCGGG TTTGCCATCG GCGATCCGGA GCTCATCGCC GGACTGCGCG CAGCCATGGG GCCGTGGGCG GTGTCCGGCC CGGCGCTGCG CGTGGGAACG CACGCTCTGA ACGATCAGAA CTGGGCCGAG GCCAGCCGCC TTGAGCTTGC GGATGGAGCC TCGCGCCTTG ACCAGCTGAT GCAGCACGCC GGCGCACAGA CCGTAGGCGG CACCGATCTC TTCCGGCTTT ATGACGTTGA AGATGCCCGC GCCTGGCAGG AGCGCCTCGC GCGTGCGCAC ATCTGGAGCC GCATCTTTCC CTATTCAACC CGTTTCCTGC GCCTTGGCCT GCCTCCCGCA GACCGCTGGA GCCAACTGGA GACCGCCCTC ACATGA
|
Protein sequence | MAERSSFAAT QDPAQASVSG SPRDHGGNLA AAIAQYGGSR EDWIDLSTGI NPAPYPLPPL LAEDWTALPD QDAQQALTDA ARRFWQVPDS AEILAAPGAS ALIARLPVLR PARSVQITTP TYNEHAAAFD NCGWQVLEVG PTDAQVVVHP NNPDGRLWLP EELKAPFCII DESFCDICPE RSLINRAAQP GTVVLKSFGK FWGLAGLRLG FAIGDPELIA GLRAAMGPWA VSGPALRVGT HALNDQNWAE ASRLELADGA SRLDQLMQHA GAQTVGGTDL FRLYDVEDAR AWQERLARAH IWSRIFPYST RFLRLGLPPA DRWSQLETAL T
|
| |