Gene Tpau_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1971 
Symbol 
ID9156126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2059008 
End bp2060585 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003646922 
Protein GI296139679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.743088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGCG CCCGCCGCAC CGTCGGCCTC ATTACAGCAG TAGCACTGGC ACTCGGCGCG 
TGCTCGTCGG GCGCGGGCGG CGCGGAGGAC CGCCTGGTAC TCGCCGAGAC TCAGCCGCTG
GGCGGATTCA ACCCACTGAT GGGGTACGGC GAACTGGGCG TGTCCCCTTT GTACGAGGGC
CTGTACCGGC CGGATGCGGC GTCGGACGGG CAGGTGCCGG ATCTTCTGCC GGCACTCGCC
ACCGCGGCGC CGCAGCGGAT CGGCCCCCGC ACCTGGCGGA TCCCGCTACG GGCCGGAGTC
ACCTTCTCCG ACGGGACACC GTTCGACGCC GCTGATGTGG TTGCGACCTA CCGCGCGGCG
CGCGATCCAA AGGTGGCGGC CGACATCGCC ACCCACGTGG CGCCGGTGCA GGACGTGACC
CCGGACGGCG GCGGCGCCGT CATAGTGCGG CTCGCCACCG ACGGTGATCC GACTCCCTAT
CTGCTCCTCG GCATCGTGCC CGCCGAGCGC ATCGAGGAGC GTCCGGTCGC CCAGTGGGGC
CTTAATCGCA CGCCCGTCGG CACCGGGCCG TACCGGCTCG ACTCCCTCGC CGATGACCAG
GCCGTCCTCG TGGCCCGTTC CGATCGCGGC CCGCAGCCGG CGGTGCGCCG CGTGGTGTAC
ACGCTGGTGC CGGACGACAA TGCCCGGGCA CAACGGGTAC GGGCCGGCGA GGTCGACGGT
GCGCTGCTGC CGCCGAAGCT GGCGGCGTCC CTCGACGGCC GTGACGGCGT GCGCACCATG
ACGGTGAAAT CGGCGGACTG GCGCGGTGTC TCGCTCCCCG CCGCCAATGC GTTCACCGCC
GATCCGGTGG CGCGGCGTGC GATGAACCTG GGCGTGGATC GCGCCGCGGT GATCTCCGGG
GTGTTGGCCG GCGCCGGGGA ACCGGCGAGC ACGCCGTACT CGTCGGTGTA CGGCGCCGCA
TACGAACCGG GAGCGCAGTT CGACTTCGAC GCTGCGGAAG CGAGTCGACT GCTCGATGAG
GCAGGGTGGC TACCCGGACC GGACGGGGTG CGCACGCGCA ACGGCAGCAC CGCCGCGTTC
GGGCTGCTCT ACAACGCACA GGACACGGTG CGGCGCGACC TGGCCGTGGC CTTCGCCGCG
GCGATGAAGC CGCTCGGTAT CGCCGTCACA CCGCAGGGCA GCAGCTGGGA CGAGATCGAA
AAGCGCACCC GCGATGCCGC GATCCTGCTG GGCGGCGGCG AGACGCCGTT CAGCATCGAC
GCGCAGGGCT ACGACGCCCT GCACACCCGG GTGCCCGGCT CGTCGCCCTA CAGCAATCCC
GGAGACTTCA CCGCGCCCGG ACTCGATGAT CTGCTGGAGC GGGCCCGGAA CCTGACTCCC
GGTCCCGAGA AGGACGCCGC GTACCGGCAG GTGCAGCGCA TCTACGCCGC CCAACCCTCG
GCCGTCTACC TCGCGCACCT GCACCACGCG TACGCCGTCC GGGCCGGCGG CTGGACCTAC
GCTCCGCCGA TCCTGGAACC GCATTCGCAC GGCGTCACGT GGGGACCGTG GTGGAACCTG
CCCTCGTGGA AACGCTGA
 
Protein sequence
MRSARRTVGL ITAVALALGA CSSGAGGAED RLVLAETQPL GGFNPLMGYG ELGVSPLYEG 
LYRPDAASDG QVPDLLPALA TAAPQRIGPR TWRIPLRAGV TFSDGTPFDA ADVVATYRAA
RDPKVAADIA THVAPVQDVT PDGGGAVIVR LATDGDPTPY LLLGIVPAER IEERPVAQWG
LNRTPVGTGP YRLDSLADDQ AVLVARSDRG PQPAVRRVVY TLVPDDNARA QRVRAGEVDG
ALLPPKLAAS LDGRDGVRTM TVKSADWRGV SLPAANAFTA DPVARRAMNL GVDRAAVISG
VLAGAGEPAS TPYSSVYGAA YEPGAQFDFD AAEASRLLDE AGWLPGPDGV RTRNGSTAAF
GLLYNAQDTV RRDLAVAFAA AMKPLGIAVT PQGSSWDEIE KRTRDAAILL GGGETPFSID
AQGYDALHTR VPGSSPYSNP GDFTAPGLDD LLERARNLTP GPEKDAAYRQ VQRIYAAQPS
AVYLAHLHHA YAVRAGGWTY APPILEPHSH GVTWGPWWNL PSWKR