Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2318 |
Symbol | |
ID | 9156474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2410368 |
End bp | 2411441 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003647264 |
Protein GI | 296140021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGACC TCCTGGCCGT CGGCCTCACC GTGCTGTTGC TCGCCGCCAA TGCGTTCTTC GTGGCTGCCG AGTTCGCGCT CATCGCCGCC CGCCGCGACC GGCTCGAGGC GCTGGTCGAC CAGGGGCGCG ACAGCGCCCG AGCGGTGATC AAGGCCTCGG AGAACCTCTC GCTCATGCTG GCGGCCTGCC AGCTCGGCAT CACCATCTGC TCGATCCTGC TGGGCAAGGT GGGTGAACCC GCCATCGCTC ATCTGTTGGA GAAGCCGATG GAGTGGGCGA ACGTGCCGGA GGCGCTGCTG CATCCGATCT CGTTCACCAT CTCGCTCAGC CTGGTGGTGG TGCTGCACAT CCTGCTCGGC GAGATGGTGC CGAAGAACAT CGCACTGGCC GGCCCGGAGT CGGCGGCCAT GTTGCTGGTT CCGCCGCTCG TCGCCTTCTC GAAGCTGATG CGCCCGGTGA TCGCGGTGTA CAACTGGCTC GCGACGCTGG TGCTGCGTGC ATTGGGCGTG GAACCTCGCG ACGAATTGGA GGGCACGATC TCGGCGGGCG AACTGTCCGA GCTCATCGCC GAATCGCACG ACGAGGGCCT GATCGACGCG GAGGAACAGC TGCGACTTAC CCGCGCGCTC CAGACCTCGC GACGCACCGT CGCCGATGTG GCGATCCCGC TCGCGCAGAT CCGCGGGCTG CAGGCCGTTG CGGGGCCGGG TGGCACCTAC GGACCCACCC TCGGCGATAT CGAATCCGCT GTCGCCGAGA CGGGCTTCTC GCGGTACCCG GTGCGCAGCC GAGGCGGCGC TTTCACGGGC TACCTGCACG TCAAGGATGT CTTGCCCGAC ATCATGGACG CCTCGGTCGG GCCGGACACC GTGATCGGCG CCGGCCGCAT CCGACCGCTC CCGGTGGTCG ACGGCGCGCT CTCGCTGGAC CAGGCCACCT CGCATCTGCG TCGTCTCGGT GGGCATCTCG CCGCCGTCGC CGACGATCAC GGTCGCACGA TCGGCATCGT CGCCCTGGAG GACGTGGTCG AGGAGTTCGT GGGCACCGTG CGCGATGGAA CGCACCGGGT GTGA
|
Protein sequence | MNDLLAVGLT VLLLAANAFF VAAEFALIAA RRDRLEALVD QGRDSARAVI KASENLSLML AACQLGITIC SILLGKVGEP AIAHLLEKPM EWANVPEALL HPISFTISLS LVVVLHILLG EMVPKNIALA GPESAAMLLV PPLVAFSKLM RPVIAVYNWL ATLVLRALGV EPRDELEGTI SAGELSELIA ESHDEGLIDA EEQLRLTRAL QTSRRTVADV AIPLAQIRGL QAVAGPGGTY GPTLGDIESA VAETGFSRYP VRSRGGAFTG YLHVKDVLPD IMDASVGPDT VIGAGRIRPL PVVDGALSLD QATSHLRRLG GHLAAVADDH GRTIGIVALE DVVEEFVGTV RDGTHRV
|
| |