Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0207 |
Symbol | |
ID | 9154341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 220459 |
End bp | 221529 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_003645200 |
Protein GI | 296137957 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.855752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC GCCTTCCTGC CCTCGCCCTG GTGATCGTCA TGGTCGCCTG GGCATCCGCC TTCGTCGTCA TCCGCGGGAT CGGGCCGCAT GTCTCGCCCG TTCCCCTGGC CGAAGGACGC CTCCTGGTAG GTGCCGCCGT CCTCGGCGTC ATCTGGGCGC TGCACGCCCT GTACACCGGC GACGTCCGGC TCCCCCGTGG TCGCGCGCTC GCCCTCACCG TGGCCTACGG CGCGCTGTGG TTCGCGCTGT ACACCGTGCT GGTCAACGCG GCCGAACAGC ACCTCGACGC CGGCACCACC GCACTGCTGG TGAACATCGC ACCGATCATC GTCGCGGTCC TGGCCGGCGC GTTCCTGCAC GAGGGCTTCC CGCCGCTGCT CACCGCAGGA ATCGTGATCA GCTTCGCCGG CGCGGCCGTG ATCGCCTTCT CCGGCGACGG CCGGCGCGAC GGCGCCGGGG TGGCCCTCGC CGTCGCCGCT GCGCTGCTCT ACGGGATCAG CGTGGTGGCG CAGAAACTGG TAACGCGCTC GACGGATCCG CTCACGGCCA CCGCACTGGG CGCGATCATC GGCGCGATCG TGCTGCTCCC GTGGGCGCCC CAGTTGTTCC GGGAGCTGGC GGCCGCGCCG ATCGGCTCCA CCGCCGGCGT GCTCTACCTG GGGCTGGTGC CCACTGCCCT GGCCTTCCTC CTCTGGGCAT ACGCACTGGC GCACACTCCG GCCGGGGTCA CGGCATCGTC GTCCTACGCG GTGCCGGCCC TGTCGATCCT GTTGAGCTGG GGCTTCCTCG CCGAAACGCC CACCGCCTGG GGCCTGCTGG GCGGGGTGCT GTGCCTGGTC GGCGTCGCGG TGGCGCGGCT GCCGCGACGG TCAGCGCGTC CGGGCGACGT CGTCGCGCAG GACGGTGGGC CACAGGGCGA CGTCCGCGGG GGTGCGGGCG ACGGTCAGCG TGGCGCGCGG GAGGAGCTCG GCGAGGGTCT CGGCGGTCGA GACGGGGTGC GCCGGATCGT CGATCCAGGC GAGCAGCGTG ACCGGGACCT CGATCGCGGC GAGCGCCTCG CGTTCCGGTA G
|
Protein sequence | MKNRLPALAL VIVMVAWASA FVVIRGIGPH VSPVPLAEGR LLVGAAVLGV IWALHALYTG DVRLPRGRAL ALTVAYGALW FALYTVLVNA AEQHLDAGTT ALLVNIAPII VAVLAGAFLH EGFPPLLTAG IVISFAGAAV IAFSGDGRRD GAGVALAVAA ALLYGISVVA QKLVTRSTDP LTATALGAII GAIVLLPWAP QLFRELAAAP IGSTAGVLYL GLVPTALAFL LWAYALAHTP AGVTASSSYA VPALSILLSW GFLAETPTAW GLLGGVLCLV GVAVARLPRR SARPGDVVAQ DGGPQGDVRG GAGDGQRGAR EELGEGLGGR DGVRRIVDPG EQRDRDLDRG ERLAFR
|
| |