Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4127 |
Symbol | |
ID | 9158315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4252792 |
End bp | 4254390 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | protein of unknown function DUF222 |
Protein accession | YP_003649035 |
Protein GI | 296141792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA CGGTGTGCAC GTCAGCGCAG TATTTGGATG CGCTCGCCGC CTTCGAACAC GCCACTGAAA TCCTGGCCTC CACGGATCCG GTGCTGCTGT CCTCCCCCGA AGTGTTGGAG GCGTTGTCGC GGCTGGAAGT TGCGGCGCGG AAGGTGCCGG TGGCGCAGCA CGCGCTGGTG CAGGTCGCGC ACGAGCAGGG CCTGCCCGGC AGCCTCGGCT ACACCGGCCT CAAGGAAATG TTGGTGGATC GGTTACGGCT CGCCGGCGCC GAAGCCCGCG ACCGGGTTGC CGGTGCGCGG GGCCGGGCGA TCGAGCACCT GCCCTGTGGT GTTGCGCAGC CGCGGTTCGC GTTGACGTTG GCGGCGCAGC GGCAGGGGGC GATCAGTGAA CGGCATGCGT TGGTGATCGA GCAGGTGTTC CGGTCCTGCC CACCCAGTGT GGATCAGCCC GACGTGTTGG AAGACATTCT GGTGACGGCG GCGCGGGAGG TCACCCCCGA AGACGTAGCC AAGGTCGGCC AGCGGGCGAT CGAGTTGCTG GACCCCGATG GAGTGGAGCC GAACCCGGAG AAAGCACGCC GCGCCCGCGG ACTGGAAATC GGGCGCCAGG ATAAGAATCT GATCTCGGAC TTCGGCGGCA CGATGTCCGC CGAATGCCGG GCGTTGATGG AAGCGGTATT CGCGAAATAC GCCCGCCCCG GGGTGCACAA CCCGGACGAT CCCGACAGCC CGGCTGAGGG CGCCACCGCC GACCAGTTGC ACGAGGCGGC GGAGCGGGAT CACCGCAGCC TCGCCCAACG CCAACACGAC GCGTTGACGC ACGCCCTGCG GCTCGCGGTC TCGTCGGGTG AACTCGGCCA GCACCGCGGA CTGTCATGCG TTCCGATCAT CACCATGGGC ATCGATCAGC TCGAATCCGA GGTCGGGATC GCGACCACCG CCACCGGCGG CAGGCTGTCC ATCACAGACG CCATCCGCAT GGCGGGCACC AACCCGAAAT ACGTCCTCCT CCTCGACCTC CACCAACGAC CCCTGTTCCT CGGTCGAGAG AAGCGTCTCG CCACCGCGGA TCAGCGGATC GCGCTCTACG GCAGCGAACG GGGCTGTAGC GCACCGGGCT GCGATGCCCC CGCGACGCGG ACGCAGGTGC ATCACGTCAC CGAATGGCAG CGTGGCGGGC GCACCGACAT CACCGGCCTC ACCCTTGCGT GCGATGCCCA CCACGCCAAA GTCAATCCCA CACCCAGTGG GATGGAGACC ATCGTCGTCC CCATCGGCGA CTACGCCGGG CGGATCGGAT GGCGCCGCAA CGCCGACCCC ACCGGCAAGC ACCGCATCAA CCACAACCAC CACGCCGCCG AGCTGTATTA CCAGGCCCTC GAACGCTGGC ACCGCACCCG CGAACAACTC CGCAGGCAAT GGCGAGCAGA GGATCTACGC GTCCAGTACC AGGACCGCAT CGATACCACC TACACCGATA TCGACGCGAT CCTCGATGGC CCGCACGGCC CACCCGTACT CGAAGCACTT CTGCACGAAC ACGACACCGA CAACACCTGG CGCGCAAGCG TGCCCGCAAA CATTCCTGAC GCAGCCTAG
|
Protein sequence | MTETVCTSAQ YLDALAAFEH ATEILASTDP VLLSSPEVLE ALSRLEVAAR KVPVAQHALV QVAHEQGLPG SLGYTGLKEM LVDRLRLAGA EARDRVAGAR GRAIEHLPCG VAQPRFALTL AAQRQGAISE RHALVIEQVF RSCPPSVDQP DVLEDILVTA AREVTPEDVA KVGQRAIELL DPDGVEPNPE KARRARGLEI GRQDKNLISD FGGTMSAECR ALMEAVFAKY ARPGVHNPDD PDSPAEGATA DQLHEAAERD HRSLAQRQHD ALTHALRLAV SSGELGQHRG LSCVPIITMG IDQLESEVGI ATTATGGRLS ITDAIRMAGT NPKYVLLLDL HQRPLFLGRE KRLATADQRI ALYGSERGCS APGCDAPATR TQVHHVTEWQ RGGRTDITGL TLACDAHHAK VNPTPSGMET IVVPIGDYAG RIGWRRNADP TGKHRINHNH HAAELYYQAL ERWHRTREQL RRQWRAEDLR VQYQDRIDTT YTDIDAILDG PHGPPVLEAL LHEHDTDNTW RASVPANIPD AA
|
| |