Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1896 |
Symbol | |
ID | 9156046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 1981653 |
End bp | 1982942 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | Protein of unknown function DUF2029 |
Protein accession | YP_003646848 |
Protein GI | 296139605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.73633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCCG ACCCTGTCCT CGTCATCAAG TGGATCCTGT GGCCGCTCGC CATCCTGACG CTGCTGCAGC GGCTGATCGT GCTGGTCCCG AATCGAGCCA AGACCGACGA CTTCACCACG GTCTACACCG CGGCCGTGCG GTTCCTGCGG CACGAGCCGG TGTACGTGGA GAACTACGAC ACCGTCGACC CCCACTACCT CTACGCACCG TCCGGCTCGC TACTCATGGC ACCCTTCGGG TACTTCGATC CGGAGACGGC CCGGAACATC TTCGTGGCGC TGAGCACGAT CGCGCTGCTG GTGGCGCTGT ACGTGATGCT GCGGATGTTC GGATACGGCC TCAACTCCGC CGTGGCGCCG GCCGTACTGT TCTTCGCGGT GATGACCGAG GCCGTGACCA ACACCCTGGT CTACACGAAT TTCAACCTGT TCCTGCTGCT CGCCGAGGTC GTGGCGATGG CGCTGCTGCT CAAGCGCAAG GACCTGTGGG CCGGCGTACC GCTCGGACTG TCGTTCGCGG TGAAACCTCT CCTCGCGGTC TTCCTGCTGT TGATCATCCT GAACCGGCAG TGGAAGGCCC TGATCACGGC GATCAGCGTG CCCGTCGTGC TGCTGGGTAT CGGTTGGGCC CTCGCAGCGG ATCCCGAGAG CTACATCGAT CGCACCATGC CGTACCTCGG CCAGGCGCGC GACTATTACA ACAGCTCGAT CAGTGGCAAC GCCGCCTACT TCGGGCTCCC CGATTGGCTC ACGCTGCTGC TCAAGGCCGC CGTGGTGGTC ATGGCACTGA TCTCCGTATA CCTGCTGTAC CGCTACTACC GCACCAGCAA CGAACTGTTG TGGTGGTCCA CCTCGGCCGG CGTGCTGATG ACCGCGATGT GGCTGGTGGG GTCGCTCGGG CAGGGCTACT ACTCGATGCT GCTGTTCCCG ATGCTGATGA CAGTGGTCTT CCGCAGCTCG GTGATGACCA ACTGGCCCGC GTGGCTCGCC GTGTACGGCT TCCTCAGTTA CGACTTCTGG CTCTCTGGGA AATGGCTCGC GCTGGGCCGC AGCCTCGCGT ATCTCAAGTG CACACTGGGC TGGACTCTGC TCATCGTGGT CACCTTCTGC GTGCTCCTCT ACCGCTACCT CGACGCGCGT GCCGAGGGTC GGCTCGACCA GGGCATCGAC CGGCCGCTCC CGCCCGGTGA CGAACCCGGC CAGCATGAGC GGCACGAACC GGCAACCGTT GCGGCGCACA CCGACACCGG TGATGACGGA CCGGTGGCAC CCGGCGTCGC ACGCGGTTAA
|
Protein sequence | MKSDPVLVIK WILWPLAILT LLQRLIVLVP NRAKTDDFTT VYTAAVRFLR HEPVYVENYD TVDPHYLYAP SGSLLMAPFG YFDPETARNI FVALSTIALL VALYVMLRMF GYGLNSAVAP AVLFFAVMTE AVTNTLVYTN FNLFLLLAEV VAMALLLKRK DLWAGVPLGL SFAVKPLLAV FLLLIILNRQ WKALITAISV PVVLLGIGWA LAADPESYID RTMPYLGQAR DYYNSSISGN AAYFGLPDWL TLLLKAAVVV MALISVYLLY RYYRTSNELL WWSTSAGVLM TAMWLVGSLG QGYYSMLLFP MLMTVVFRSS VMTNWPAWLA VYGFLSYDFW LSGKWLALGR SLAYLKCTLG WTLLIVVTFC VLLYRYLDAR AEGRLDQGID RPLPPGDEPG QHERHEPATV AAHTDTGDDG PVAPGVARG
|
| |