Gene Tpau_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1950 
Symbol 
ID9156105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2036713 
End bp2038038 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003646901 
Protein GI296139658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCG GAAGCTTCCA GGCTCCCAAG GGCATCCCCG ACTACATCCC GGTCCCGTCC 
GGCGAGAAGA ACAGCTCGGC TGATTTCCTC GCGGTGCGCA CGGCGCTGCT GCGCGCTATC
GCGGATGCCG GCTACGGCTA CATCGAGCTG CCGATCTTCG AGGACACCTC CCTGTTCGCG
CGCGGCGTCG GTGAGTCGAC CGATGTGGTG GCCAAGGAGA TGTACACCTT CGCCGACCGC
GGCGACCGCT CCGTGACGCT GCGCCCGGAG GGGACGGCCG GTGTGGTCCG CGCGGTCCTG
CAACACGGCC TGGATCGGGG CCAACTGCCG GTCAAGGCGG CCTACGCCGG TCCGTTCTTC
CGCTACGAGC GTCCACAGGA GGGCCGGTAC CGGCAGCTTC AGCAGGTGGG CATCGAGGCG
ATCGGCGTGG ACGATCCCGC ACTCGATGCC GAGGTGATCG CCGTGGCCGA CCGCGCCTAT
CGCAGCGTCG GTCTCGACGG GTTCCGCCTG GAGGTCTCCA GCCTGGGTGA CGAGACCTGC
CGCCCGCAGT ACCGGGAGAA GTTGCAGGAG TTCCTGTTCG CGCTCGACCT CGATGAGGAG
ACCCGTCGGC GTGCCGAGAT CAATCCGCTC CGCGTGCTCG ACGACAAGCG GCCCGAGGTG
AAGGCCATGA CGGCCGATGC ACCCCTCATG ATCGACAACC TCACCCCGGA GCCCAAGGAG
CACTTCGAGA AGGTGCTCGG CTACCTCGAC GCGCTCGGGG TGCCCTATGT GGTGAATCCG
CGCCTGGTCC GCGGCCTCGA TTACTACACC AAGACCTGTT TCGAGTTCGT GCACGACGGT
CTCGGCGCCC AGTCGGGAAT CGGCGGCGGC GGCCGCTACG ACGGCCTGGT CGAGCAGCTG
GGCGGCCGCG AGGGCGTGAC CGGCGTCGGA TTCGGACTCG GCGTCGACCG CACGCTGCTG
GCGTTGGCCG CCGAGGGCAA GCGCGCACCG GCGGCGCCTC GCGTGGTGGC CTTCGGCGTC
CCGCTGGGCG ACGACGCCCG GGATGCGATG GTCCCGCTCC TCGGACGGCT CCGCGCGCTC
GGTGTGCCCT CCGATATGGC CTACGGCAAC CGCGCGATGA AGGGCGCGAT GAAGGCCGCC
GACCGTTCCG GTGCCCGATT CGCCCTGATC CTGGGCGATT CCGAGCTGGC CGACGGCGTG
GTGATGCTCA AGGATCTGGC GAACGGCGAG CAGCGGGCGG TGCCGCTCGA TACCGTGGCG
GGCGTGATCG CCTCCGCGAA CGACGTGAAC GGTGCCGACG GCGCGAGCGC GCGGAGCGCG
GGGTAA
 
Protein sequence
MSAGSFQAPK GIPDYIPVPS GEKNSSADFL AVRTALLRAI ADAGYGYIEL PIFEDTSLFA 
RGVGESTDVV AKEMYTFADR GDRSVTLRPE GTAGVVRAVL QHGLDRGQLP VKAAYAGPFF
RYERPQEGRY RQLQQVGIEA IGVDDPALDA EVIAVADRAY RSVGLDGFRL EVSSLGDETC
RPQYREKLQE FLFALDLDEE TRRRAEINPL RVLDDKRPEV KAMTADAPLM IDNLTPEPKE
HFEKVLGYLD ALGVPYVVNP RLVRGLDYYT KTCFEFVHDG LGAQSGIGGG GRYDGLVEQL
GGREGVTGVG FGLGVDRTLL ALAAEGKRAP AAPRVVAFGV PLGDDARDAM VPLLGRLRAL
GVPSDMAYGN RAMKGAMKAA DRSGARFALI LGDSELADGV VMLKDLANGE QRAVPLDTVA
GVIASANDVN GADGASARSA G