Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1123 |
Symbol | |
ID | 9155263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1149188 |
End bp | 1150378 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_003646094 |
Protein GI | 296138851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.632622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTACGT TGCCACCGCT CGTCGAACCC GCCGCCGAAC TGACCCGTGA CGAAGTCGCG CGTTATTCGC GGCACCTGAT CATCCCCGAG ATGGGCGTCG AGGGGCAGAA GCGTCTCAAG AACGCCAAGG TCTTGGTGAT CGGTGCCGGC GGTCTCGGCA GCCCCGCGCT GCTGTACCTC GCCGCGGCGG GTATCGGAAC CATCGGGATC GTCGAGTTCG ACGAGGTGGA CGCGTCCAAC CTGCAGCGCC AGGTGATCCA CGGCGTCTCC GACCTGGGCC GGCCCAAGGC CGAGAGCGCG CGCGACTCCA TCGCCGAGAT CAACCCGCTG GTCACGGTGA ACCTGCATCA GGAACGGCTC GAACCCGAGA ACGCGGTGCA GCTCTTCGAG CAGTACGACC TGATTGTCGA CGGCACCGAC AACTTCGCCA CCCGCTACCT GGTCAACGAC GCAGCGGTGC TGGCGCACAA GCCCTACGTC TGGGGCTCGA TCTTCCGGTT CGAGGGGCAG GCGTCGGTGT TCTGGGAGGA TGCGCCCGAC GGTCCGAACG GTGAGAAGCA GGGCCTGAAC TACCGCGACC TGTACCCGGT GGCGCCACCG CCCGGCATGG TTCCCTCGTG CGCCGAGGGC GGCGTGCTCG GCATCCTGTG CGCCTCGATC GGGGCGATCA TGGGCACCGA GGCGGTCAAG CTGATCACCG GTATCGGCGA CTCGCTGCTC GGCCGGCTCA TGGTCTATGA CGCCCTCGAC ATGACCTACC GCACCATCAA GATCCGCAAG GACCCCGCTT CACCGGCGAT CACCGAGCTG ATCGATTACG AGGAGTTCTG CGGGGTGGTC TCGGACGAGG CTCAGGCCGC TGCCGCGGGG AGCACCATCA CCCCGGCCGA GTTGGTCGAG CTGGGCGAGC AGGGCGTGGA GTACGAGCTG ATCGACGTGC GTGAGCCTGT CGAATGGGAC ATCGTGCATA TCGACGGTGC GAAGCTGGTA CCGAAGTCGG TCTTCGAGAC CGGTGAGGGC CTGGAGCGGG TCTCGGCCGA CAAAAAGCTG GTGCTCTACT GCAAGACCGG TATCCGCTCC GCCGAGGTGC TCGCAGCGGT GCAGGGCGCC GGGTACCGCG ACGCGGTACA TCTGCAGGGC GGAATCAACG CCTATGCCCG GCAGGTGGAT CCGTCACTAC CGGTGTACTG A
|
Protein sequence | MSTLPPLVEP AAELTRDEVA RYSRHLIIPE MGVEGQKRLK NAKVLVIGAG GLGSPALLYL AAAGIGTIGI VEFDEVDASN LQRQVIHGVS DLGRPKAESA RDSIAEINPL VTVNLHQERL EPENAVQLFE QYDLIVDGTD NFATRYLVND AAVLAHKPYV WGSIFRFEGQ ASVFWEDAPD GPNGEKQGLN YRDLYPVAPP PGMVPSCAEG GVLGILCASI GAIMGTEAVK LITGIGDSLL GRLMVYDALD MTYRTIKIRK DPASPAITEL IDYEEFCGVV SDEAQAAAAG STITPAELVE LGEQGVEYEL IDVREPVEWD IVHIDGAKLV PKSVFETGEG LERVSADKKL VLYCKTGIRS AEVLAAVQGA GYRDAVHLQG GINAYARQVD PSLPVY
|
| |