Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2233 |
Symbol | |
ID | 9156389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2325137 |
End bp | 2326234 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | arsenical-resistance protein |
Protein accession | YP_003647181 |
Protein GI | 296139938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACATCGC CCGCCGCAGG ACCCCGGGTC GTGGGGAAGT TGTCGCCCCT GGATCGGTTT CTGCCCCTCT GGATCGGCGC CGCCATGCTC ACGGGCCTGC TGCTCGGCAG GATGATCCCG GGGCTGAACT CCGCACTGGA GCGGATCCAG CTCGACGGTG TCTCTTTACC GATCGCCCTA GGGTTGTTGG TCATGATGTA CCCGGTGCTC GCGAAGGTTC GTTACGACCG GCTCGGCACC GTCACCTGCG ACCGCACACT GCTCCTCGCC TCCCTGGTGC TGAATTGGAT CATCGGGCCG GCGCTCATGT TCGCGCTCGC GTGGCTATTC CTACCGGACC TTCCTGAGTA CCGCACCGGC CTCATCATCG TGGGCCTGGC GCGGTGCATC GCCATGGTGA TCATCTGGAA CGACCTCGCC TGCGGCGACC GCGAGGCTGC CGCCGTTCTC GTCGCGCTGA ACTCAGTGTT CCAAGTGGTC ATGTTCGCGG TGCTCGGATG GTTCTATCTC ACTGTGCTGC CCGGCTGGCT CGGGCTGGAG CAGGCGGCGA TCGATGCGTC GATGTGGCAG ATCGCCGCGT CGGTGATCGT CTTCCTCGGT ATCCCGTTGG CGGCGGGCTA TCTCTCCCGC CGGATCGGGG AAAAGCGCAA AGGGCGAGAT TGGTACGAAT CGGTGTACCT GCCCATCGTC GGCCCGTGGG CGCTGTACGG GCTGCTGTTC ACCATCGTCG TGTTGTTCGC CTTGCAGGGC GATCAGATCA CTTCGCGTCC TTGGGATGTC GCGAGGATCG CGATTCCGCT GCTGGTGTAC TTCGGCATCA TGTGGGGTGG TGGCTATCTG CTCGGCGCTG CGATGGGCTT GGGCTACGAA CGCACGACCA CAGTGGCATT CACTGCTGCG GGCAACAACT TCGAGTTGGC GATCGCCGTC GCGATCGCAA CGTACGGTGC CACCTCGGGG CAGGCGCTGG CGGGAGTCGT CGGTCCCCTG ATCGAGGTTC CGGTGCTCGT AGGCCTGGTG TACGTCTCGC TGGCGTTACG ACGGAGATTC ACGATCGCCT CGCCGGTCGC GGAGACCGGT GTGAAGGGGG CACACTAA
|
Protein sequence | MTSPAAGPRV VGKLSPLDRF LPLWIGAAML TGLLLGRMIP GLNSALERIQ LDGVSLPIAL GLLVMMYPVL AKVRYDRLGT VTCDRTLLLA SLVLNWIIGP ALMFALAWLF LPDLPEYRTG LIIVGLARCI AMVIIWNDLA CGDREAAAVL VALNSVFQVV MFAVLGWFYL TVLPGWLGLE QAAIDASMWQ IAASVIVFLG IPLAAGYLSR RIGEKRKGRD WYESVYLPIV GPWALYGLLF TIVVLFALQG DQITSRPWDV ARIAIPLLVY FGIMWGGGYL LGAAMGLGYE RTTTVAFTAA GNNFELAIAV AIATYGATSG QALAGVVGPL IEVPVLVGLV YVSLALRRRF TIASPVAETG VKGAH
|
| |