Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4020 |
Symbol | |
ID | 9158202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4148842 |
End bp | 4150122 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Protein of unknown function DUF2029 |
Protein accession | YP_003648930 |
Protein GI | 296141687 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.643753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGAACG AGATCAGCCG GCGGTTCCCG CGCCAGTGGT GGGTGATGGC GCTGTTCGCC GTGGCCGCCG GGATCGCGGT GTGGCAGCAG TTCGTGCGAA TCCCGTTCTC CACGCCGATG TTCGGCCTGT TCCACAACCA GATTGACGCC GAGGTCTACC GCAAGGGTGG CGAGATCGTG GCGCACGGCG GGGCCGGCCT CTACGACGGT CCGCTGCTCA ACGACATCCT GCCCTTCACG TACACGCCCT TCGCGGGCGT GATCTTCGTT CCGCTCTCGT GGATGTCGAC CGAGGTGCTG CGCTGGGTGT GGACGATCGC GATCATGGTC GCGCTGCTGG CCTGCATCCT GATCGCCTTC CGGCTGCTGG GCTTCGCCCG CGACCGCCGG GTCTGGTTCG CCGCGATCTG CCTCACCCTG GTGGCCACGG TGCTCGAACC GGTGCGCACC ACGATCTGGT TCGGGCAGAT CAACGTCTTC CTCATGCTGA TCCTGCTGTG GGACCTCGGA CGCCCGAAGG GGTCGCGCGG CAAGGGATTC AGCATCGGCA TCGCCGCCGG CATCAAGCTC ACGCCGCTGT TCTTCCTCGC CTACCTCGTG GTCACCCGGC AGTGGCGCGA GGCCCGCAAT GCGCTGATCA CCCTCGCCGG CACCGTGGCC ATCGGCTTCG CGGTGATCCC GCAGGAATCG CTGAAGTACT GGTCGGGCAC GTTCCTCGAC GCGAACCGCG TGGGCGAGCC CAACCAGGTG GGCAACCAGT CCATCAACGG CCTGATCGCC TACCTCTCCC ATGCTGACGA ACCGTCGACC GCGCACTGGC TGCTCGCGGC CGTGCCCGCC GCGCTCGCCG GTCTCGCCGT GGCCGCGTGG GCGTACCGGC GCGGTCACGT GCTGCTCGCG ATCACCCTCG TCGGCCTCAC CGCGTGCGCC GTCTCGCCGT TCTCGTGGGG CCACCACTGG GTGTGGTTCG TGCCGCTGCT CATCGTGCTG CTCACCCCGG CGTTGCTCGC GACAGACTGG CCGCACCGCC TGGCCTACCT GGTGGCGCCG CTCGCGCTGG TGGCGGCCAC GTTCATCTGG GTCACCTATT TCCCGAACCT GCAGCCCACC GGGGTCGAGA CCGTGCACGA CACCTATGCC ATCGGTATCT TCTTGCGGCC CTTCGACAAT CCTGTGCTCA AGGCCCTCGC GGGCGGTGTC TACGTCTGGC TGTTCCTGGC CACCCTGATC GGCAGTGCGG CGTATCTCGC CCGGGCGGAC CGGTCGGGTC GTGACGCCTG A
|
Protein sequence | MANEISRRFP RQWWVMALFA VAAGIAVWQQ FVRIPFSTPM FGLFHNQIDA EVYRKGGEIV AHGGAGLYDG PLLNDILPFT YTPFAGVIFV PLSWMSTEVL RWVWTIAIMV ALLACILIAF RLLGFARDRR VWFAAICLTL VATVLEPVRT TIWFGQINVF LMLILLWDLG RPKGSRGKGF SIGIAAGIKL TPLFFLAYLV VTRQWREARN ALITLAGTVA IGFAVIPQES LKYWSGTFLD ANRVGEPNQV GNQSINGLIA YLSHADEPST AHWLLAAVPA ALAGLAVAAW AYRRGHVLLA ITLVGLTACA VSPFSWGHHW VWFVPLLIVL LTPALLATDW PHRLAYLVAP LALVAATFIW VTYFPNLQPT GVETVHDTYA IGIFLRPFDN PVLKALAGGV YVWLFLATLI GSAAYLARAD RSGRDA
|
| |