Gene Tpau_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1001 
Symbol 
ID9155141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1025531 
End bp1026814 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content64% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003645973 
Protein GI296138730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.628257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACCA CGGGAGCGGC GGGACTCAGC CGCCGGAGCG TGCTGAAAGG CGCACTCGCT 
GCGTCGACGT TGGCCGCACT CGCGCCGACC CTCGCATCAT GCGGTCGCTC GGACTCACCA
CGCCCCCTCC ACCTGGGGTC GCGGAACTCG GACTCGAAGG CCAAAGCAGG TATGGCGGCG
CTCGTGGGCG CATTCACCGA ACGTACGTCG ATTCCGGTAT CTGTGAACGT GGTCGATACG
GTCTCGTTCC AGGAGAACCT CAACAACTAT CTCCAGGGTG CGCCGGACGA CGTGTTCACC
TGGATGTCGG GTTACCGGAT GCGCTACATC GCCAATAAGA AGCTGGTCTC TCCGTTGGAT
GCGGTCTGGC CCGCCATCGA CCGTGGCTTC GACAGTTCGT TCAAGGGCGC GGCGACCGAC
AACGGTCACG CCTACCTCGT GCCCCTGACG TACTACCCGT GGGCGATCTA CTACCGCAAG
AGTGTGTGGC AACGCTATGG CTACCAGCCG CCGCGCACCC ACGCCGACTT CATCGATCTG
TGCAAGCGGA TGAAGGCGGA CGGCATCGTT CCGCTGGGAT TCGCCGCGCG GCAGGGGTGG
ACCACGTTCG GCATGTTCGA CTACCTCAAT CTGCGGATCA ACGGTCCCGA TTTCCACCGC
AGCCTGCTGG CCGGCGAGAT CTCCTGGACC GACAACCGGG TCTACGCGAC CTTCGACGCC
TGGCGCGAAT TCCTACCGTT CCAGCAGGAA CAACCGCTGG GGCGCACCAT CGCGGAGGCG
CACACCGCAC TGCTCAACCG CAAGGTGGGA ATGATGGTCA CGGGATTGTT CGTTTCGGAG
CAGTTCCCCG CGGGCCCCGA CCTCGACGAC CTGGACTTCT GCCCGTTCCC GGAGTTCGAC
AGCGCCATCG GCGCCGGAGC GGTCGAGGCA CCCCTCGACG GACTCATGAT GAGTGCGAAC
CCCCGGAACC GGGAGGCCGC CACGCAGTTC CTCGAATACG CCGCCTCGGC GGAAGCGGGC
ATCAAGTACA GCAGCGGGCT CGCGATGTCG ATTCCTGCTC ACAAGGACAT CTCGCTCGCC
GACTACTCGC CGCTCATCCA GAAGGCAGCC GCGCTGGTGA AGAACAGCAA TTCGATCACG
CAGTTCCTCG ACCGGGACAC ACGACCGGAC TTCTCATCGA TCGTGATCAT CCCTTCATTG
CAACAGTTCA TCCGGGCCCC ACAGGATCTC CGTTCGGTGC TCGCGAGCAT CGAGAAGCAG
AAGAAGGCGG TGTTCGACCG ATGA
 
Protein sequence
METTGAAGLS RRSVLKGALA ASTLAALAPT LASCGRSDSP RPLHLGSRNS DSKAKAGMAA 
LVGAFTERTS IPVSVNVVDT VSFQENLNNY LQGAPDDVFT WMSGYRMRYI ANKKLVSPLD
AVWPAIDRGF DSSFKGAATD NGHAYLVPLT YYPWAIYYRK SVWQRYGYQP PRTHADFIDL
CKRMKADGIV PLGFAARQGW TTFGMFDYLN LRINGPDFHR SLLAGEISWT DNRVYATFDA
WREFLPFQQE QPLGRTIAEA HTALLNRKVG MMVTGLFVSE QFPAGPDLDD LDFCPFPEFD
SAIGAGAVEA PLDGLMMSAN PRNREAATQF LEYAASAEAG IKYSSGLAMS IPAHKDISLA
DYSPLIQKAA ALVKNSNSIT QFLDRDTRPD FSSIVIIPSL QQFIRAPQDL RSVLASIEKQ
KKAVFDR