Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4030 |
Symbol | |
ID | 9158214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4158326 |
End bp | 4159948 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | PE-PPE domain protein |
Protein accession | YP_003648940 |
Protein GI | 296141697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.276755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAGT TCGTATCGAC GTCCCGTTCC CGCCGTCTTG CCACCGGGTT GATCGCCCTG GTGAGCACGA TCGCACTCGC CCTGGGCACC GCCGCCGTCG GTGCCGCGGT CTGGGCGACG ACGGTGCTGG TCGTGCCCGG AACGGGAACG AAGAACCCCG CCGTGGTGCC GGGCTATCAG GAGAACGCCG TCGGCTATTA CGTGGCACCG ACGGGGGCCT GCGCGGACGG TTGCACCGCC GCCCCGGTGC CGTACATCGC CGAGTTCTGG CCGATCCCGC TCAAGGGGTG GGGCGGGCTC GAGGGCGCCA AGTGGGACGA TTCCGTTGCC AGCGGGGTCA CCAGCCTCAA CACCGTGTAC GCGGCCACCG ACAAGACCGA GCCGATCGTG ATCTTCGGCT ACTCGCAGGG CGGGACGGTG GTCAGCGATG TGAAGCGGCA GTTCGCCGCG CAGCCGGGCG GCGTACCCGA GAACGTCTCG TTCGTCCTGA TCGGCAATGC GAGCCGGCCC AATGGAGGCC TGTTCTCACG GCCGGCGTTC CTCGGCCATA TCCCGATCCT CGACGTCACC TTCAAGCCGG GCACGCCCAC CGACACCTCG ACACAGGTCA ACACGACCGA CGTCGCGTTC CAATACGACG GTGTGACCGA CTTCCCGCGC TACCCCGTGA ATCTGCTCGC CACCGCGAAT GCGCTCGCCG GTTTCTGGTT GGTGCACGGC AAATACCTCA CGCCCAAGGG CGACGATCCG GCAACGGCGA CCCCCATCGG GTACACCCCT GATGAGGTAC GCACCGCAGT CGCCCGGGCC ACCGAGAACT GCACCGCCGC GAACTACTGC CAGGTCAGCG GCGACACCCG ATACGTGACG CTGCCAGCGA AGATCCTGCC GATCATGCTG CCGCTGCTCG ACCTGGGCGC CGCCACCGGA ACCACCGCTG TGGTGCGCCC GCTCGTCGAC CTGATCTCCC CGGTTACCCG GGTTCTCATC GAAACCGGTT ACACGCGTGA TGATTACGGC CGCCAGACGC CGTTCGGTGT GGTGCCGCTG CTGAATCCGT TCACGCTGAC CGCCGATCTC GCCAGCGGAG TGGTGGAGGG TGTGCGCGCC GCCGCCACCG GCACCGGTGA CGCCTCGCGC TACCTCGCGC CGAAGCCCGC GCCGGCGACC GAATCCACGG GTACCCCGAG CGCCGCGGCT TCCTCGACCG CTGCACCGCA ACCGGTCTCG CTCACCCTCG CGCAGACGCC GTCGAGCACC GCCGCGGCGG GATCGAGCGC AGCGGGATCG AGCGCAGCGG GATCGAGCGC GGCGGGATCG AGCGCGGCGG CGACCGGTGC CGAGTCCACG AGCGCAGCGA CGAGCAGCAC GGCATCGTCA CAACCGACCG ACACCGCCAA GGCCACCGGC ACCGCGACCA CGGAAGCCAC ACCGACCGGT TCCGCCGCGG GCGCGGCCTC GGGCACCGCG AAGGACACCT CGCCGGGCGC CTCGGAGGAG ACCTCTTCGT CGGGCACCGC GAAGGCGGGC ACGTCGAAGG AGTCCACCGG CGCCCCGGCC ACGGACGCGG CATCGACCGG CTCCTCGGCG AGCGAGCCGG TCGGGGTGGC CGCGGCGGCC TAG
|
Protein sequence | MPEFVSTSRS RRLATGLIAL VSTIALALGT AAVGAAVWAT TVLVVPGTGT KNPAVVPGYQ ENAVGYYVAP TGACADGCTA APVPYIAEFW PIPLKGWGGL EGAKWDDSVA SGVTSLNTVY AATDKTEPIV IFGYSQGGTV VSDVKRQFAA QPGGVPENVS FVLIGNASRP NGGLFSRPAF LGHIPILDVT FKPGTPTDTS TQVNTTDVAF QYDGVTDFPR YPVNLLATAN ALAGFWLVHG KYLTPKGDDP ATATPIGYTP DEVRTAVARA TENCTAANYC QVSGDTRYVT LPAKILPIML PLLDLGAATG TTAVVRPLVD LISPVTRVLI ETGYTRDDYG RQTPFGVVPL LNPFTLTADL ASGVVEGVRA AATGTGDASR YLAPKPAPAT ESTGTPSAAA SSTAAPQPVS LTLAQTPSST AAAGSSAAGS SAAGSSAAGS SAAATGAEST SAATSSTASS QPTDTAKATG TATTEATPTG SAAGAASGTA KDTSPGASEE TSSSGTAKAG TSKESTGAPA TDAASTGSSA SEPVGVAAAA
|
| |