Gene Tpau_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3621 
Symbol 
ID9157800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3735400 
End bp3737490 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content67% 
IMG OID 
ProductOligopeptidase B 
Protein accessionYP_003648538 
Protein GI296141295 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACG CCGTCCAGCC CGTCGCCAAG AAGGTTCCCA GTGAACGGAC CTTCCACGGC 
GACACCGTGA CCGACGACTA CGCCTGGCTC GCCGACACCT CGAACGAGGA GGTGCTCGAC
TACCTCCACC GGCACAATGC CTACACCGAG GCTGCGACGG CGACGCAGGA GCCGCTGCGC
CAGAAGATCT TCAATGAGAT CAAAGCGCAC ACCCAAGAGA CCGATATGTC GGTCCCGCAA
CGTCGCGGTG GCTATTGGTA CTACGCCCGC ACCAAGGAAG GCGCCCAGTA CGGCATTCAC
TGTCGTGCAC CGATCACCGG ACCCGACGAC TGGTCGCCGC CGATCCTCTC CGATCAGCCG
CTCCCCGGCG AGCAGGTGGT GATCGACCTC AACGTGGAGG CCGAGGGGCA TGAGTACATC
GCCCTCGGTG CCGCGTCCGT CTCCACCGAC GGCGAGTTGC TGGCGTACTC GCTGGACACT
TCCGGTGACG AGCGATTCAC CTTGCGCGTG CGCAACATCG GCACCGGTGA GGTACTACCC
GATGTGGTCG AGGGCGTCTT CTACGGCGCC ACGTGGGCAC CCGATTCGCG GCACCTCTTC
TACACCACCG TTGACGATGC CTGGCGCGCC GACAGTATCT GGCGCCACGA GATCGGTTCC
GGTGCAGAGG ATGTACGCGT CTTCCACGAG ACCGATCAAC GGTTCGGTGT GGGCGTCGGG
CTCACCCGCA GCGAGCGCTA TCTGATGATC GCCGCATCGT CGACGTTGAG CTCCGAGACC
TGGGTCCTCG AGGCGACCGA TCCGACCGGC GAGTTCCGCG TGCTCATCCC GCGACAAGAA
GACGTCGAGT ACTCCGCGGA GCATGCCGTG CTCGACGGCG AAGACCGGTT CCTCCTGCTG
CACAACCGCA CCGGCATCAA CTTCGAGCTG GTCTCCGCGC CGGTCGACGC ACCCGAGGAC
TGGACCGTCG TCGTGCCGCA CCGAGACGAC GTGCGCCTCG AGTACGTCGA CGCCTACGCG
CGCACCCTGG CGCTCGGCTA CCGCCGCGCA GGCCTGCCCC GCCTCGCGCT CGCCGAAGCC
ACGACAGCAC CGTCGTTCAC CGAATGGGAT CCGGGCGAAC CGCTCGCCAA CGTGGGCCCC
GCGGCGAACC CCGAATGGGA CGCGCCGCGC CTGCGCCTGG CGTACGAATC GTTCGTGACG
CCGGGCACGG TCTTAGAACT CGACGCCGCG ACCGGGGCGT CGACGGTGCT CAAGCGCGTC
AACGTGCCGG GCTACGACTC CGCGCTGTAC ACCGCGGAAC AGATCTGGGT GAGTGCGCGC
GACGGCGCCG AGGTCCCGGT CTCGGTGGTG CATCGCAAGG ACATCCCCGC GGGCGCCGCT
CCGACCCTGC TCTACGGCTA CGGCTCCTAC GAGGCCACCC TGGATCCGTG GTTCTCCGTG
GCCCGGCTCT CGCTCATGGA TCGCGGCGTC GTGTTCGCCG TCGCCCACAT CCGCGGCGGG
GGCGAAATGG GGCGCGCCTG GTACGAGCAC GGCAAGCAGC TGGAGAAGAC CAACACCTTC
ACGGATTTCG TGGATGTGGC TCGGCATCTC GTCGACGCGG GGCGCGCGGC CCCCTCGAAG
CTGGTCGCCA TGGGCGGCAG CGCGGGCGGT CTTCTGGTGG GTGCCGTCGC GAACCTCGCG
CCGGAATTGT TCTGCGGCAT CGTCGCCGAC GTTCCCTTCG TGGACCCGCT CACCTCGATC
CTCGATCCGT CGCTCCCGCT GACCGTGGGG GAGTGGGACG AATGGGGTAA TCCGCTGGAG
AGCGCCGAGG TGTACCGCTA CATGAAGGCC TACTCGCCGT ACGAGAACGT CGAGGCCAAG
GCCTATCCCG CGCTGCTGGT GACCACCTCG CTCAATGACA CCCGCGTACT CCCCACCGAG
CCGGCGAAAT GGGTCGCGAA ACTCCTCGAT CACACCACCT CCGGTGAGCA GATCCTGCTC
AAGACCGAGA TGGTCGCCGG CCATGCGGGC GTCAGCGGGA GGTACGCCAA GTGGCGGGAG
ACCGCGTTCG AGTACGCCTG GGTCCTCGAC AGGCTGGGTG CGGCCCAGTA G
 
Protein sequence
MTDAVQPVAK KVPSERTFHG DTVTDDYAWL ADTSNEEVLD YLHRHNAYTE AATATQEPLR 
QKIFNEIKAH TQETDMSVPQ RRGGYWYYAR TKEGAQYGIH CRAPITGPDD WSPPILSDQP
LPGEQVVIDL NVEAEGHEYI ALGAASVSTD GELLAYSLDT SGDERFTLRV RNIGTGEVLP
DVVEGVFYGA TWAPDSRHLF YTTVDDAWRA DSIWRHEIGS GAEDVRVFHE TDQRFGVGVG
LTRSERYLMI AASSTLSSET WVLEATDPTG EFRVLIPRQE DVEYSAEHAV LDGEDRFLLL
HNRTGINFEL VSAPVDAPED WTVVVPHRDD VRLEYVDAYA RTLALGYRRA GLPRLALAEA
TTAPSFTEWD PGEPLANVGP AANPEWDAPR LRLAYESFVT PGTVLELDAA TGASTVLKRV
NVPGYDSALY TAEQIWVSAR DGAEVPVSVV HRKDIPAGAA PTLLYGYGSY EATLDPWFSV
ARLSLMDRGV VFAVAHIRGG GEMGRAWYEH GKQLEKTNTF TDFVDVARHL VDAGRAAPSK
LVAMGGSAGG LLVGAVANLA PELFCGIVAD VPFVDPLTSI LDPSLPLTVG EWDEWGNPLE
SAEVYRYMKA YSPYENVEAK AYPALLVTTS LNDTRVLPTE PAKWVAKLLD HTTSGEQILL
KTEMVAGHAG VSGRYAKWRE TAFEYAWVLD RLGAAQ