Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4002 |
Symbol | |
ID | 9158184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4132463 |
End bp | 4133701 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | domain of unknown function DUF1727 |
Protein accession | YP_003648912 |
Protein GI | 296141669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGAC TCCCCCTCCG CGCCCGCACG GCGCTGTTCG CCGCCCGTTC CGCAGCCTGG GCGTCGCAGC GCGCCGGCCG CGGCAAGGGC TCGATGATCG GCGGGCTCGT GGCCCTGAAG ATCGAGCCCA ACCTGATGAC CAGCCTCGCG GCGGGCAAGC GCACGGCCGT GATCACCGGC ACGAACGGCA AGTCGACCAC CACCCGGATG ACGGCGACCG CGCTCGGGAC GCTCGGCGCC GAGGTCGCGA CCCAGGCCGA CGGCGCGAAT ATGGACGCCG GCATCGTCGC CGCGCTGACC GCCCACCTCA CGGCACCGCT CGCGGCCCTG GAGGTGGACG AGCTGCACGT ACCGCACGTC CTCGACGCGG TGCAGGCCGA GACGCTCGTG CTGTTGAACC TCTCGCGCGA TCAGCTCGAC CGGGTGGGTG AGATCAACGC CATCGAACGC AAGATCCGCG CCGCGGTCGA TGCGCACCCC GCCTTGACCG TGGTCGCCAA CTGCGACGAC GTGCTGGTCG CCTCGGTGGC GTACGACGCG AAAGAGGTCG TGTGGGTCGC CGCCGGCGCC GGCTGGACCT CGGACTCCGT CTCCTGTCCG CGCTCGGGTG AGCCGATCGT GCGTGACGGC GAGCATTGGT ATTCGACGGG CACCGACTTC TCGCGACCTG CCCCCGACTG GTGGGTCGAT GACGAGAACA TCTACGGCCC GGATGGTTTC GTCGCTCCGC TCACACTGAA GTTGCCCGGT CGGGCGAACC GCGGCAATGC CGCGCAGGCC GTCGCGGCCG CCGTCGCGAT GGGTGCCGAT CCCGCAGCCG CGGTCGCCGC GGTCGGTACG GTGGGCGAGG TGGCCGGCCG CTACTCGACG GTGACCGTGG GCGAGCACAC CGTGCGGATG CTGTTGGCCA AGAATCCGGC CGGATGGCAG GAGGCGATGT CGATGATCGA CGGCACCGCG GAGGGACTGG TGATCGCGGT GAACGGTCAG GTCCCCGACG GCGAGGACCT TTCCTGGCTT TGGGATGTGC AGTTCGAGCG GTTCGAGAAC GGCAAGGTGG TCGCCTCGGG CGAACGCGGT GCCGATCTGG CGGTACGGCT CACCTACGCC GGGGCCGAGC ACTCGCTGGT GGCCGATCCG GTGGCGGCGA TCGCATCGTG CCCACCCGGA CGAGTGGAGG TGCTGGCGAA CTACACCGCC TTCCGTGATC TGAATACGGC ACTGGAGGGC CGCGCCTGA
|
Protein sequence | MPGLPLRART ALFAARSAAW ASQRAGRGKG SMIGGLVALK IEPNLMTSLA AGKRTAVITG TNGKSTTTRM TATALGTLGA EVATQADGAN MDAGIVAALT AHLTAPLAAL EVDELHVPHV LDAVQAETLV LLNLSRDQLD RVGEINAIER KIRAAVDAHP ALTVVANCDD VLVASVAYDA KEVVWVAAGA GWTSDSVSCP RSGEPIVRDG EHWYSTGTDF SRPAPDWWVD DENIYGPDGF VAPLTLKLPG RANRGNAAQA VAAAVAMGAD PAAAVAAVGT VGEVAGRYST VTVGEHTVRM LLAKNPAGWQ EAMSMIDGTA EGLVIAVNGQ VPDGEDLSWL WDVQFERFEN GKVVASGERG ADLAVRLTYA GAEHSLVADP VAAIASCPPG RVEVLANYTA FRDLNTALEG RA
|
| |