Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4019 |
Symbol | |
ID | 9158201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4147617 |
End bp | 4148852 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF1212 |
Protein accession | YP_003648929 |
Protein GI | 296141686 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.959569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGCCTG ACCGGGTACC CGATGCCGGG GCCGAGCACC GCCGGCTCCT CGCTTATCTG GGCGCCGCCA TGATCGCCGG CGGCCAACCC GTGCACGAGG TGGAGGACGA ACTGCGCGCC GTCGCCACCG CACTGGGGCA CCCGATGACG CAGGTCGGGG CTACACCCAC CGGACTCACC GTGGGCCTGG CCCCCGGCGA GCCGGCCACC TTCGAATCGA TCGACGGTCC GCTCCGGCTG GAACAATCCG CCGTGGTCGA CGATGTCCGA CGGGGCCTCG TCGCCGGCAC CACCGCACCC GACGAGGCCT TCGGCCGGCT CGGCTCGCTG CGCTCGCTGC CGCACCGTTA CTCCCGGTGG GTGCAGGACG CTTCCTGGTC GGTGATCGCG ATCGGCATCG CGATGCTGCT GCAGCCCGGC TGGGCGAACA TCGCCGCGGC GGCGATCGGC GGAGTGGTGG TGATGCTGTT GGTGACACTG TCCGGGCGGG TGCAGTTCGT ATCCACCCTG CTGCCCACGA TCGCGGCCTT CACCGTCTCG ACCGGCGTCT TCGCCGCGGC CCAGGCCGGC TGGCTCGACG GTCCGCTGCG CACCCTGCTC GCGCCGCTGG CGGTGCTGCT CCCCGGTGCG CTGATGGTGA CCGGCATGTC GGAACTGGCC GCGGGCGCGA TGGTCGCCGG CACCGCACGG CTCACCTTCG GCATCGTCGC GCTGATGCTG TTCTCGATCG GCGTGTTCAG CGCCACCACG ATGCTGAACG TGGCGCCGGA ACTGATGATC AACCTGCGCG TCAACGAACT CGGCTGGTGG GGGCCGCCGG TGGGTCTGCT GCTCATCTGC ATCGGGGTGT GTCTCAACGA GGGCGCGTCG ATGCGGCTGC TGCCCCCGAT CGTCTGCGTG GTGGCCGCGG CCTTCGGTGC CCAGCTCGCC GGGCAGGAGC TCTCGGGTGC GGTGCTCGGC GGCTTCCTGG GTGCCGTTGC GGGGAGCCTC GGCGCCTCGA TCGCCGAATC CGTCCGCCCC GACCTACCGC GGCTCGTGGT GTTCCTGCCC GCGTTCTGGG TGCTGGTGCC CGGCAGCCTG GGCTTGTTGT CGGTGACCTC GGTGGGCCTC GATCCGGCGC AGGGTGCACG CACCGCGGTC GATGTGGCCG CCGTGATCTG CGCGCTCGCT TTGGGGCTTT TGTTCGGTTC CGCGCTGGCT CAGGCGTTCG CCCGGCGGCG TGATTCCCGG CGCTGA
|
Protein sequence | MTPDRVPDAG AEHRRLLAYL GAAMIAGGQP VHEVEDELRA VATALGHPMT QVGATPTGLT VGLAPGEPAT FESIDGPLRL EQSAVVDDVR RGLVAGTTAP DEAFGRLGSL RSLPHRYSRW VQDASWSVIA IGIAMLLQPG WANIAAAAIG GVVVMLLVTL SGRVQFVSTL LPTIAAFTVS TGVFAAAQAG WLDGPLRTLL APLAVLLPGA LMVTGMSELA AGAMVAGTAR LTFGIVALML FSIGVFSATT MLNVAPELMI NLRVNELGWW GPPVGLLLIC IGVCLNEGAS MRLLPPIVCV VAAAFGAQLA GQELSGAVLG GFLGAVAGSL GASIAESVRP DLPRLVVFLP AFWVLVPGSL GLLSVTSVGL DPAQGARTAV DVAAVICALA LGLLFGSALA QAFARRRDSR R
|
| |