Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2026 |
Symbol | |
ID | 9156181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2110849 |
End bp | 2112198 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | allantoinase |
Protein accession | YP_003646977 |
Protein GI | 296139734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGG ACAGCACCGA GTACGACGTG GTCGTCCGCG GCCGCCGGAT CATCACCACA GCGGGTGTCG TCGCGCGAGA GGTCGGTATC CGCGACGGCC GCGTGGTGGC GATGGAACCG CTCGGCAACG CTCTCCTGGG CAGGGAGACG ATCGAGCTCA CCGACGAGCA GGTCATGATT CCTGGGCTCG TGGATACCCA CGTGCACGTG AACGAACCGG GCCGTACGGA ATGGGAGGGC TTCGAATCCG CGACGAAGGC CGCCGCCGCC GGTGGTGTCA CGACGCTGGT CGATATGCCG CTGAACTCGA TCCCGCCCAC CGTCGACCGC GAGGCGCTTC GGCTCAAGCG CGAGGTCGCA GAGCCCAAGG CGTACATCGA CGTCGGATTC TGGGGCGGCG CCGTTCCGGG CAACCGTGAC CAACTGCGCA GCCTGCACGA CGACGGGGTG TTCGGCTTCA AGTGCTTCCT GCTGCACTCC GGCGTGGACG AGTTCCCGCA CCTCACGGCC GACGAGATGG AAGAGGACAT GGCCGAACTG GCCACGTTCG ATTCCCTGAT GATCGTGCAC GCCGAGGATT CCCGGGCGAT CGATCACGCG CCCTCTGCGG AGGGCGCCGA GTACGCGCGC TTCCTCGCTT CCCGTCCACG GGGCGCGGAG AACGTCGCGA TCGCGGAGGT GATCGAGCGC GCCCGCTGGA CCGGCGCCCG CGCCCACATC CTCCATCTCT CATCGTCCGA CGCACTGCCG ATGATCGCGT CGGCTCGCCG GGACGGCGTG AAACTCACCG TCGAGACCTG TCCGCACTAC CTCACGTTGC TCGCCGAGGA GATCCCCAAC GGCGCCACCG CCTTCAAGTG CTGTCCCCCC ATCCGGGAAG CCTCGAACCG CGAACTCCTC TGGCAGGGCC TCAAAGACGG CGTGATCGAC TGCATCGTGT CCGACCACTC CCCTTCGACG CTCGACCTCA AGGACGTGGA GAACGGCGAC TTCGGCGTGG CGTGGGGCGG CGTCGCCTCA CTCCAGTTGG GGCTCTCACT GATCTGGACC GAGGCCAAGC GCCGTGAGAT CCCGCTGGAA CAGGTAGTGG AGTGGATGTC CGCCAAGCCC GCCAAGCTCG CAGGCATGAC CCGCAAGGGC CGCATCGCGC TGGGCTTCGA CGCCGACTTC GCGATCTTCG AACCGGAGGC GGCGCAGGTC GTCGACGTGC ACCGGCTGCA CCACAAGAAC GCCATCTCGC CCTACGACGG CAAGGCCCTC GCCGGTGTCG TGACCCAGAC CTGGCTGCGT GGCGTGCGGA TCGACTTTCA GACTCCGCAC GGCAAGCTGC TGCGCCGCGG CGACGCCTGA
|
Protein sequence | MSTDSTEYDV VVRGRRIITT AGVVAREVGI RDGRVVAMEP LGNALLGRET IELTDEQVMI PGLVDTHVHV NEPGRTEWEG FESATKAAAA GGVTTLVDMP LNSIPPTVDR EALRLKREVA EPKAYIDVGF WGGAVPGNRD QLRSLHDDGV FGFKCFLLHS GVDEFPHLTA DEMEEDMAEL ATFDSLMIVH AEDSRAIDHA PSAEGAEYAR FLASRPRGAE NVAIAEVIER ARWTGARAHI LHLSSSDALP MIASARRDGV KLTVETCPHY LTLLAEEIPN GATAFKCCPP IREASNRELL WQGLKDGVID CIVSDHSPST LDLKDVENGD FGVAWGGVAS LQLGLSLIWT EAKRREIPLE QVVEWMSAKP AKLAGMTRKG RIALGFDADF AIFEPEAAQV VDVHRLHHKN AISPYDGKAL AGVVTQTWLR GVRIDFQTPH GKLLRRGDA
|
| |