Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1971 |
Symbol | |
ID | 9156126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2059008 |
End bp | 2060585 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003646922 |
Protein GI | 296139679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.743088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGCG CCCGCCGCAC CGTCGGCCTC ATTACAGCAG TAGCACTGGC ACTCGGCGCG TGCTCGTCGG GCGCGGGCGG CGCGGAGGAC CGCCTGGTAC TCGCCGAGAC TCAGCCGCTG GGCGGATTCA ACCCACTGAT GGGGTACGGC GAACTGGGCG TGTCCCCTTT GTACGAGGGC CTGTACCGGC CGGATGCGGC GTCGGACGGG CAGGTGCCGG ATCTTCTGCC GGCACTCGCC ACCGCGGCGC CGCAGCGGAT CGGCCCCCGC ACCTGGCGGA TCCCGCTACG GGCCGGAGTC ACCTTCTCCG ACGGGACACC GTTCGACGCC GCTGATGTGG TTGCGACCTA CCGCGCGGCG CGCGATCCAA AGGTGGCGGC CGACATCGCC ACCCACGTGG CGCCGGTGCA GGACGTGACC CCGGACGGCG GCGGCGCCGT CATAGTGCGG CTCGCCACCG ACGGTGATCC GACTCCCTAT CTGCTCCTCG GCATCGTGCC CGCCGAGCGC ATCGAGGAGC GTCCGGTCGC CCAGTGGGGC CTTAATCGCA CGCCCGTCGG CACCGGGCCG TACCGGCTCG ACTCCCTCGC CGATGACCAG GCCGTCCTCG TGGCCCGTTC CGATCGCGGC CCGCAGCCGG CGGTGCGCCG CGTGGTGTAC ACGCTGGTGC CGGACGACAA TGCCCGGGCA CAACGGGTAC GGGCCGGCGA GGTCGACGGT GCGCTGCTGC CGCCGAAGCT GGCGGCGTCC CTCGACGGCC GTGACGGCGT GCGCACCATG ACGGTGAAAT CGGCGGACTG GCGCGGTGTC TCGCTCCCCG CCGCCAATGC GTTCACCGCC GATCCGGTGG CGCGGCGTGC GATGAACCTG GGCGTGGATC GCGCCGCGGT GATCTCCGGG GTGTTGGCCG GCGCCGGGGA ACCGGCGAGC ACGCCGTACT CGTCGGTGTA CGGCGCCGCA TACGAACCGG GAGCGCAGTT CGACTTCGAC GCTGCGGAAG CGAGTCGACT GCTCGATGAG GCAGGGTGGC TACCCGGACC GGACGGGGTG CGCACGCGCA ACGGCAGCAC CGCCGCGTTC GGGCTGCTCT ACAACGCACA GGACACGGTG CGGCGCGACC TGGCCGTGGC CTTCGCCGCG GCGATGAAGC CGCTCGGTAT CGCCGTCACA CCGCAGGGCA GCAGCTGGGA CGAGATCGAA AAGCGCACCC GCGATGCCGC GATCCTGCTG GGCGGCGGCG AGACGCCGTT CAGCATCGAC GCGCAGGGCT ACGACGCCCT GCACACCCGG GTGCCCGGCT CGTCGCCCTA CAGCAATCCC GGAGACTTCA CCGCGCCCGG ACTCGATGAT CTGCTGGAGC GGGCCCGGAA CCTGACTCCC GGTCCCGAGA AGGACGCCGC GTACCGGCAG GTGCAGCGCA TCTACGCCGC CCAACCCTCG GCCGTCTACC TCGCGCACCT GCACCACGCG TACGCCGTCC GGGCCGGCGG CTGGACCTAC GCTCCGCCGA TCCTGGAACC GCATTCGCAC GGCGTCACGT GGGGACCGTG GTGGAACCTG CCCTCGTGGA AACGCTGA
|
Protein sequence | MRSARRTVGL ITAVALALGA CSSGAGGAED RLVLAETQPL GGFNPLMGYG ELGVSPLYEG LYRPDAASDG QVPDLLPALA TAAPQRIGPR TWRIPLRAGV TFSDGTPFDA ADVVATYRAA RDPKVAADIA THVAPVQDVT PDGGGAVIVR LATDGDPTPY LLLGIVPAER IEERPVAQWG LNRTPVGTGP YRLDSLADDQ AVLVARSDRG PQPAVRRVVY TLVPDDNARA QRVRAGEVDG ALLPPKLAAS LDGRDGVRTM TVKSADWRGV SLPAANAFTA DPVARRAMNL GVDRAAVISG VLAGAGEPAS TPYSSVYGAA YEPGAQFDFD AAEASRLLDE AGWLPGPDGV RTRNGSTAAF GLLYNAQDTV RRDLAVAFAA AMKPLGIAVT PQGSSWDEIE KRTRDAAILL GGGETPFSID AQGYDALHTR VPGSSPYSNP GDFTAPGLDD LLERARNLTP GPEKDAAYRQ VQRIYAAQPS AVYLAHLHHA YAVRAGGWTY APPILEPHSH GVTWGPWWNL PSWKR
|
| |