Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0078 |
Symbol | |
ID | 9154212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 83839 |
End bp | 85104 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003645071 |
Protein GI | 296137828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.656506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACCA TCATGAAAGC ACGGTTCGCC GCGTTCGCGG CTTCCTTCGT CATCGCAGCC TCATTGCTTT CCGCACCGCC GTCCGCGTCG GCGAACGGCT CCTATGGGTT CGGCCAACGC GGCGTGAACA CCAACATGCT ATTCGTCCAA TCACAGACCG CATACGACTG GCCGGACGAA TCGCCATCGA AATTCGGTGA GCCGCAGAGT ACATACGACT ACCTTGCGGC GAAAGGGCAC AAGTACGTCC GCATCGGGTT CAACTGGGAC GTGATGCAGC GTGGAATCAC TGCCACTAAT ACCGGGGCAC CATTGGACGC GAAATACGTA GCGAAGGTCT CATCGGAGAT CGCCAAGGCG AAGAAGGCCA GCCTGAAGGT AGTACTCGTG CTGGGCAATC CATGCACGTG GGAGGACCGC ACCGGTGCTA CGAATTGGGA GGAACCCACT CCCAGCAGAG TCCTGTTGTG CGGTAAAGGA CTGACCGACG CCCACCTCGC CAACCTCTGG CGGCGAATCT CGACACTCTA CAAGAGCGAG CCGGCCGTCG CCGTGTACGA CCTGGTCAAC GAGCCCGTCT CCTATCAGCA TCCCATCCGG ACAGACATGC AGGATCCGGG GACCTGGCCG GCTCCGTACA GCGCCTACAA GTCCGCGATC AACGCCTCGA TCAAAGCCAT TCGTGACAAC GGCGACGACA AGTATGTCTG GGTACAGTCG TTGTGCTGCA CCCTCAACCA CGACTTCGCC AGCACCGACC CGAACGGGCC CTGGGCCGTC GATCCGCTCA ATCGCATCGA GTATTCGCAG CACATGTATC CCGTCAGCGA CCCCAATACG GCCTCGAAAT TCGAAGAGAA GAAGCTCGAT CCGAACTACG ATTCGGCACC CGGACAATTC TGGTCCGATC GCGGGTACAC CACCGGGTTC CTGTGGCGAC TGGATACTTT CGGTGGTTGG TGCAGCCAGT TCTCCGTGAA GTGCTCGATC GGCGAAGTCG GCTGGCACAA CGAAATATCA GAGCCCGAAA CCGCTGCGCG ATGGAACGAC CTCGGCGACG AGTTCTATAA CAAGGCCAAC TACTACGGCT TCGACGTCCT GTCATTCGGC GTCACGACCG GGCTTCAGGG CGCCCTCAGC ACGCACGGGA CCAACCCCGT CGGACAGCCC CCGAACCTCC AGGGATGGCA ATTCCCGGCG CCCGGCATTA CCCGCAGCTT CTCCCAGGCC ACCGTCATCG AGAAACCGCA GCACCTCTCG AAGTGA
|
Protein sequence | METIMKARFA AFAASFVIAA SLLSAPPSAS ANGSYGFGQR GVNTNMLFVQ SQTAYDWPDE SPSKFGEPQS TYDYLAAKGH KYVRIGFNWD VMQRGITATN TGAPLDAKYV AKVSSEIAKA KKASLKVVLV LGNPCTWEDR TGATNWEEPT PSRVLLCGKG LTDAHLANLW RRISTLYKSE PAVAVYDLVN EPVSYQHPIR TDMQDPGTWP APYSAYKSAI NASIKAIRDN GDDKYVWVQS LCCTLNHDFA STDPNGPWAV DPLNRIEYSQ HMYPVSDPNT ASKFEEKKLD PNYDSAPGQF WSDRGYTTGF LWRLDTFGGW CSQFSVKCSI GEVGWHNEIS EPETAARWND LGDEFYNKAN YYGFDVLSFG VTTGLQGALS THGTNPVGQP PNLQGWQFPA PGITRSFSQA TVIEKPQHLS K
|
| |