Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4201 |
Symbol | |
ID | 9158389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4329528 |
End bp | 4330622 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | protein of unknown function DUF475 |
Protein accession | YP_003649108 |
Protein GI | 296141865 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACTGC GGACCTTCGG ACTCTCGATC GTCGTGACGA TCGCCGCCCT GATCGTCGCC TTCCTCTACG GCGGCCCCCA GGCCCTACTG CTCACCGCGA TCCTGGGCGT CCTGGAGATC AGCCTGTCGT TCGACAACGC GGTGATCAAC GCGACGGTGC TGCGCCGGAT GAGCGACTTC TGGCAGAAGA TGTTCCTCAC CGTGGGCATC CTGATCGCCG TGGTGGGCAT GCGGCTGTTG TTTCCGCTGG CCATCGTGTG GATCACCGCG GGTCTCGACC CGGTGCACGC CATGCGACTC GCGCTCAACC CGCCCACCGA CGGCTCCCCC ACGTACGAAT CTCTCGTGAC CGCGGCGCAC CCCCAGATCG CGGCCTTCGG CGGCATCTTC CTGCTGATGC TCTTCCTCGA CTTCGTACTC GCGGAGAAGG ACGTCACCTG GCTCAGCTGG ATCGAGAAGC CCCTGGAGAA GATCGGCCGG CTCGACCAGC TGTCCGTGGT GATCGCCTCC GGCGCGCTGC TCTACACCGC GATGTACATC GCTCACCCCG GCGAGGAGAC CACGGTGCTC GTCGCCGGTC TGCTGGGAAT GATGACGTAC CTGGTGGTGA ACGGCCTCGG CGAGCTCTTC CACATCGACG AGGAAGCGGA GATCGCCGAC CTCGACGCCG AGAGCAAGCC GAACAGCGGA CCATCGGAGC TGGCGAAGGC GGCCGGTAAG GCGGGCTTCT TCCTGTTCCT CTACCTCGAG GTGCTCGACG CGTCGTTCTC CTTCGATGGG GTGATCGGCG CGTTCGCGAT CACCGCTGAT CCGATCATCA TCGCCCTGGG CCTCGGCTTG ATCGGCGCGA TGTTCGTCCG GTCCATCACC GTGTACCTGG TGCGCCAGGG CACGCTGTCG CAGTACGTGT ACCTCGAACA CGGGGCGCAC TGGGCGATCG GTGCGCTCGC CGTGATCCTG TTGTACTCGA TCGGCACCCC GGTACCCGAG GTGGTGACCG GCCTGATCGG ACTCGTGCTG ATCATCGCGG CGCTGATCTC CAGCGTGGTC CGCAACCGGC GTGAGGGCGC CACGCCGGTT CCTGTGGATG CCTAA
|
Protein sequence | MALRTFGLSI VVTIAALIVA FLYGGPQALL LTAILGVLEI SLSFDNAVIN ATVLRRMSDF WQKMFLTVGI LIAVVGMRLL FPLAIVWITA GLDPVHAMRL ALNPPTDGSP TYESLVTAAH PQIAAFGGIF LLMLFLDFVL AEKDVTWLSW IEKPLEKIGR LDQLSVVIAS GALLYTAMYI AHPGEETTVL VAGLLGMMTY LVVNGLGELF HIDEEAEIAD LDAESKPNSG PSELAKAAGK AGFFLFLYLE VLDASFSFDG VIGAFAITAD PIIIALGLGL IGAMFVRSIT VYLVRQGTLS QYVYLEHGAH WAIGALAVIL LYSIGTPVPE VVTGLIGLVL IIAALISSVV RNRREGATPV PVDA
|
| |