Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3745 |
Symbol | |
ID | 9157925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3863433 |
End bp | 3865028 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function DUF222 |
Protein accession | YP_003648662 |
Protein GI | 296141419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGG CGGTGTGCAC GTCAGCGCAG TATCTGGATG CGCTCGCCGC CTTCGAACAC GCCACTGAAA TCCTTGCCTC CACGGACCCG GTGCTGCTGT CCTCCCCCGA AGTGTTGGAG GCGTTGTCGC GGCTGGAAGT TGCGGCGCGG AAGGTGCCGG TGGCGCAGCA CGCACTGGTG CAGGTCGCGC ACGAGCAGGG CCTGCCCGGC AGCCTCGGCT ACACCGGCCT CAAGGAAATG TTGATCGATC GGTTACGGCT CGCCGGCACC GAAGCCCGCG ACCGGGTTGC CGGTGCGCGA GGCCGAGCGA TCGAGCACCT CCCATCCGGT GTTGCGCAGC CGCGGTTCGC GTTGACGATG GCGGCGCAGC GGCAGGGGGC GATCAGTGAA CGGCACGCCC TGGTGATCGA GCAGGTGTTC CGGTCCTGCC CACCCAGTGT GGATCAGCCC GAGGTTCTGG AAGACATTCT GGTGACGGCA GCGCGGGAGG TCACCCCCGA AGACGTAGCC AAGGTCGGCC AGCGGGCGAT CGAGTTGCTG GACCCCGATG GAGTGGAGCC GAACCCGGAG AAAGCACGCC GCGCCCGCGG ACTGGAAATC GGGCGCCAGG ACAAGAATCT GGTCTCCGAC TTCGGCGGCA CCCTCTCCGC CGAATGCCGG GCGTTGATGG AAGCGGTATT CGCGAAATAC GCCCGCCCCG GGGTGCACAA CCCGGACGAT CCCGACAGCC CGGCTGAGGG CGCCACCGCC GACCAGTTGC ACGAGGCGGC GGAGCGGGAT CACCGCAGCC TCGCCCAACG CCAACACGAC GCGTTGACGC ACGCCCTGCG GCTCGCGGTC TCGTCGGGTG AACTCGGCCA GCACCGCGGA CTGTCATGCG TTCCGATCAT CACCATGGGC ATCGATCAGC TCGAATCCGA GGTCGGGGTC GCGACCACCG CCACCGGCGG CAGGCTGTCC ATCACAGACG CCATCCGCAT GGCGGGCACC AACCCGAAAT ACGTCCTCCT CCTCGACCTC CACCAACGAC CCCTGTTCCT CGGGCGGGAG AAACGCCTCG CCACCGCCGA TCAACGGATC GCGCTCTACG GCAGCGAACA CGGCTGCACC GCACCCGGCT GTGACGCCCC CGCGACGCGG ACGCAGGTCC ATCACGTCAC CGAATGGCAG AACGGAGGCC GCACCGACAT CACCGGCCTC ACCCTCGCCT GCGATGCCCA CCACAGCAAA GTCTCGCCGG CACTATCCGG GATGGAGACC GTCGTCGTGC CCGCAGGCGA CTACGCTGGC CGCATCGGGT GGCGCCGCAA CACCACCGCC GGCCCCCACA AGGTCAACCA CAACCACCAC GCCGCCGAGC TGTACTACCA AGCCCTCGAA CGCTGGCACC GCACCCGCGA ACAACTCCGC CAGCTATGGC GCACCCACGA CCTGCGGGAA CAGTACGAGG ACCGGATCGG CAGCACCTAC ACCGACATCG ACGCCCTCCT CGACAGCCCC CACGGCCCAC CCCTGCTCGA AACACTCCTC GCCGAACACG ACGCTGACAA TGCCTGGCGC ACACGTCCAC CCGGCGACGC ACCCCGCGCT GCCTGA
|
Protein sequence | MTEAVCTSAQ YLDALAAFEH ATEILASTDP VLLSSPEVLE ALSRLEVAAR KVPVAQHALV QVAHEQGLPG SLGYTGLKEM LIDRLRLAGT EARDRVAGAR GRAIEHLPSG VAQPRFALTM AAQRQGAISE RHALVIEQVF RSCPPSVDQP EVLEDILVTA AREVTPEDVA KVGQRAIELL DPDGVEPNPE KARRARGLEI GRQDKNLVSD FGGTLSAECR ALMEAVFAKY ARPGVHNPDD PDSPAEGATA DQLHEAAERD HRSLAQRQHD ALTHALRLAV SSGELGQHRG LSCVPIITMG IDQLESEVGV ATTATGGRLS ITDAIRMAGT NPKYVLLLDL HQRPLFLGRE KRLATADQRI ALYGSEHGCT APGCDAPATR TQVHHVTEWQ NGGRTDITGL TLACDAHHSK VSPALSGMET VVVPAGDYAG RIGWRRNTTA GPHKVNHNHH AAELYYQALE RWHRTREQLR QLWRTHDLRE QYEDRIGSTY TDIDALLDSP HGPPLLETLL AEHDADNAWR TRPPGDAPRA A
|
| |