Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1233 |
Symbol | |
ID | 9155374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1265592 |
End bp | 1267553 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | transcription termination factor Rho |
Protein accession | YP_003646203 |
Protein GI | 296138960 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.303183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGAAA CGGACGTGAC CACCAGCCCG GACACGGCTG CCTCCACCTC GGCCAGCGCA CAGCGCGCCG ACGAGCGGCG CCGCTCCGGC CTGAACGGCA TGGTGCTGGC CGAACTCCGC ACCATCGCGG GTGACCTGGG GATCAAGGGG ATCTCCGGCC TCCGCAAGGG CGACCTGATC GCCGCCATCA CCGCCCAGCA GGGCTCGGGC GCGGCCCCGA CGTCCAAGGC GGAGAAGCCC GCCAAGACCG CGCCTCCCAA GGCCGCGGCT CCGAAGGCGG ACGCTCCGAA GGCGGAGGCT GCCAAGGTGG AGGCTCCGAA GGCGGAGGCC ACCACGGCGG AGGCTCCGAA GACGGAGGCG CCGAAGAACG ACGCCCCGAA GAACGACGCC CCGAAGAACG ACGCCCCGAA GAACGACGCC GCCCCGGCGG ATGCCCCGAA GGACGACGCC AAGAGCGGCG CGAAGAACGA CGGCGACCAG CCGCAGCAGC AGCGCGAGGG TGGCCGCCGC GAGCGGAACC AGCGCCGCGG CCAGAACGCC GATGGCCAGA ACCAGAACGA CCGGCAGGGC GGTGACCAGA ACCGTCCGGA CCGCGACCAG AACCGGCAGC AGGGCCAGAA GAACAACAAC AACAACGGTC CGAACAACCG GAACCAGAAC AACGGACCGG ACGACGACGG TGACGGGCAG GGCCGTCGCG GGCGGCGGTT CCGCGAGCGT CGTCGTGGAC GTGACCGCAA CCAGAACGAC GGTGAGCCGC AGGTCAGTGA GGACGACGTC CTCCAGCCCG TGGCCGGCAT CCTGGATGTG CTCGACAACT ACGCCTTCGT GCGGACATCG GGCTACATCG CGGGACCCAA CGACGTCTAC GTCTCCATGA ACATGGTCCG CCGCAACGGC CTGCGCCGTG GTGACGCCAT CGTGGGCGCC GTGAAGATGC CGCGCGACGG AGAGAGCAAT GACGGCGGTA ACCAGGGCGG CGGTGGCGGC CGCGGTAACC AGTCCAACCG GCAGAAGTTC AACCCGCTGG TGCGCCTGGA CAGCGTGAAC GGCCAGGACG TCGATTCGGC CAAGAACCGG CCCGAGTTCA ACAAGCTCAC GCCGCTGTAC CCGAACCAGC GCCTGCGCCT GGAAACGGCG CAGAATATCC TCACCACCCG TGTGATCGAC CTGATCATGC CGATCGGCAA GGGGCAGCGT GCGCTGATCT CCGCACCGCC GAAGGCCGGT AAGACCACGA TCATGCAGGA CATCGCCAAC GCGATCGCCA CCAACAACCC CGAGTGCTAC CTGATGGTGG TGCTGGTGGA TGAGCGTCCG GAGGAGGTGA CCGATATGCA GCGCAGCACC AAGGGCGAGG TCATCAGCTC CACCTTCGAC CGCCCACCGT CGGATCACAC CTCGGTCGCC GAGCTCGCGA TCGAGCGGGC CAAGCGGCTG GTGGAGGGCG GCAAGGACGT GGTGGTGCTG CTCGACTCGA TCACCCGTCT CGGCCGTGCC TACAACAACT CGAGCCCCGC TTCCGGTCGG ATCCTGTCCG GTGGTATCGA TTCCACCGCG CTGTACCCGC CCAAGCGGTT CCTCGGCGCA GCCCGCAACA TCGAGAACGG TGGCTCGCTC ACCATCATCG CCACCGCGAT GGTCGAGACC GGTTCGACCG GCGACACCGT GATCTTCGAG GAGTTCAAGG GCACCGGCAA CGCCGAGCTC AAACTGGACC GCAAGATCTC CGAGCGCCGG ATCTTCCCGG CGGTCGACGT GCACCCGTCG AGCACCCGGA AGGACGAGCT GCTGCTCGCT CCCGATGAGG CCGCCATCGT GCACAAGCTG CGCCGTCTGC TGTCGGGCCT GGACAGCCAG CAGGCGATCG AGCTGTTGAT CAGTCAGCTG AAGAAGACGC AGAACAACAT CGAGTTCCTG ATGATGGTGC AGAAGAACAG CGGTTTCGCC GACGCCGAGT AG
|
Protein sequence | MTETDVTTSP DTAASTSASA QRADERRRSG LNGMVLAELR TIAGDLGIKG ISGLRKGDLI AAITAQQGSG AAPTSKAEKP AKTAPPKAAA PKADAPKAEA AKVEAPKAEA TTAEAPKTEA PKNDAPKNDA PKNDAPKNDA APADAPKDDA KSGAKNDGDQ PQQQREGGRR ERNQRRGQNA DGQNQNDRQG GDQNRPDRDQ NRQQGQKNNN NNGPNNRNQN NGPDDDGDGQ GRRGRRFRER RRGRDRNQND GEPQVSEDDV LQPVAGILDV LDNYAFVRTS GYIAGPNDVY VSMNMVRRNG LRRGDAIVGA VKMPRDGESN DGGNQGGGGG RGNQSNRQKF NPLVRLDSVN GQDVDSAKNR PEFNKLTPLY PNQRLRLETA QNILTTRVID LIMPIGKGQR ALISAPPKAG KTTIMQDIAN AIATNNPECY LMVVLVDERP EEVTDMQRST KGEVISSTFD RPPSDHTSVA ELAIERAKRL VEGGKDVVVL LDSITRLGRA YNNSSPASGR ILSGGIDSTA LYPPKRFLGA ARNIENGGSL TIIATAMVET GSTGDTVIFE EFKGTGNAEL KLDRKISERR IFPAVDVHPS STRKDELLLA PDEAAIVHKL RRLLSGLDSQ QAIELLISQL KKTQNNIEFL MMVQKNSGFA DAE
|
| |