Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3120 |
Symbol | |
ID | 9157291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3229152 |
End bp | 3230570 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | fumarate lyase |
Protein accession | YP_003648046 |
Protein GI | 296140803 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000941956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGA ACGATGGTTC GCCGCAGGGC GAGTTCCGGA TCGAGCACGA CACGATGGGC GAGGTGCGGG TCCCCGCAGC CGCTCTCTGG CGCGCGCAGA CCCAGCGCGC CGTGGAGAAC TTCCCGATCA GCTTCCGTCC GCTCGAACGC CCCCAGATCC GCGCCATGGG GCTGCTCAAA GCCGCATGCG CTCAGGTCAA CAAGGATCTC GGTTTGCTCG CGCCGGACAA GGCCGACGCC ATCATCGCGG CCGCCCAGGA GATCGCCGAC GGCCGCCACG ACGACCAGTT CCCGATCGAT GTCTTCCAGA CCGGTTCCGG CACCAGTTCC AATATGAACA CCAACGAGGT GATCGCCTCG ATCGCGGCGG GGCACGGCGT CACGGTGCAT CCGAACGACG ATGTGAACAT GTCGCAGAGC AGCAACGACA CCTTTCCCAC CGCAACGCAC GTCGCTGCGA CCGAGGCTGC GGTCACGGGC TTGATCCCCG CGTTGGAGCA GCTGCACGCC GCGCTCGCCG AGAAGGCCGA GCAGTGGCGC ACCGTGGTGA AGTCGGGCCG GACCCACTTG ATGGATGCCG TCCCGGTCAC GCTCGGCCAG GAGTTCGGCG GCTACGCCCG GCAGGTCGAG GCCGGCATCG AGCGGGTACG CGCCTGCCTG CCCCGCGTCG GTGAGGTCCC GATCGGCGGC ACCGCCGTGG GCACCGGGCT CAACGCCCCC GACCGGTTCG GTTCGCTGGT GGTGGCCGAA CTGGTCCGCC TGACCGGGGT GCAGGACATC CGGCTGGCGA AGGACAATTT CGAAGCACAG GCCGCCCGCG ACGGACTGGT GGAGCTCTCG GGTGCGCTGC GCACCGTCGC GATCTCGCTG ACCAAGATCG CCAACGACGT GCGATGGATG GGTTCGGGGC CGCTCACCGG TCTCGGCGAG ATCGCACTCC CCGATCTGCA GCCCGGAAGC TCGATCATGC CCGGGAAGGT CAATCCGGTT CTGCCCGAAG CGGTCACGCA GGTGGCGGCG CAGGTGATCG GCAATGATGC CGCGGTCGCC TGGGGCGGGG GTAACGGCGC CTTCGAACTC AATGTGTACA TCCCGATGAT GGCGCGCAAC GTGCTGGAAT CGGTGAAGCT GCTCACCAAC GTCTCGGTCC TGTTCGCGGA CAAGTGCATC GCCGGGCTCG AGGCGCACGA GGAACGGCTG CGCACGCTGG CCGAATCGTC GCCGTCGATC GTGACGCCAC TGAATTCGGC GATCGGCTAC GAAGAAGCGG CGGCCGTGGC CAAGCAGGCA CTCAAGGACG GCACGACGAT CCGGCAGGCG GTGATCGACC GCGGGTTGAT CGGCGATGCG CTCTCCGTCG ACGAACTCGA TAAGCGGCTG GACGTGCTGA AGATGGCGAA CCTGGACCGG AAGCGCTGA
|
Protein sequence | MAKNDGSPQG EFRIEHDTMG EVRVPAAALW RAQTQRAVEN FPISFRPLER PQIRAMGLLK AACAQVNKDL GLLAPDKADA IIAAAQEIAD GRHDDQFPID VFQTGSGTSS NMNTNEVIAS IAAGHGVTVH PNDDVNMSQS SNDTFPTATH VAATEAAVTG LIPALEQLHA ALAEKAEQWR TVVKSGRTHL MDAVPVTLGQ EFGGYARQVE AGIERVRACL PRVGEVPIGG TAVGTGLNAP DRFGSLVVAE LVRLTGVQDI RLAKDNFEAQ AARDGLVELS GALRTVAISL TKIANDVRWM GSGPLTGLGE IALPDLQPGS SIMPGKVNPV LPEAVTQVAA QVIGNDAAVA WGGGNGAFEL NVYIPMMARN VLESVKLLTN VSVLFADKCI AGLEAHEERL RTLAESSPSI VTPLNSAIGY EEAAAVAKQA LKDGTTIRQA VIDRGLIGDA LSVDELDKRL DVLKMANLDR KR
|
| |