Gene Tpau_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_0804 
Symbol 
ID9154944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp817763 
End bp818932 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content71% 
IMG OID 
Productaminotransferase class I and II 
Protein accessionYP_003645779 
Protein GI296138536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACCG TCCACCGCCT CCGGCCCTTC ATGTCCACGA TCTTCGCCGA GGTCTCGACG 
CTCGCGGTGG AGCACGATGC GGTCAACCTG GGTCAGGGGT TCCCCGACAC CGACGGCCCC
GCCGGCATGC TCGCCGCGGC GAAGACGGCC ATCGACGGCG GGATGAACCA ATACCCGTCC
GGCGACGGCC TTCCGGAGCT GCGCCGCGCC GTCGCCGCCC AGGTGCAGCG CGACTACGGC
CTGGGCTACG ACCCGAACGG CGAGGTATTG GTCACCGTCG GCGCGTCCGA GGGGATCGCG
GCCGCCGTGC TGGGCCTCGT CGAACCCGGC CGCGAGGTGA TCCTGATCGA GCCCTTCTAC
GATTCCTACG CCGCGACCGT GGCGATGGCC GGCGCACACC GCGTGGTCGT GCCGCTCGTC
GAAGACGCCG GACGGTACGC CCTCGACCCC GCGATGCTCC GGGCGGCCGT CACCGAGAAG
ACGGCGGCGA TCATCGTCAA CTCCCCGCAC AATCCCACCG GCACGGTGCT CTCGCACGAA
GACCTGCAGC TCGTCGCCGC TGTGTGCGTC GAACGCGATC TGCTCTGCTT CACCGACGAG
GTCTACGAGC ATCTGCTCTT CGACGGCCGC GTGCACACAC CGCTCGCGAC GTTCGAGGGC
ATGCGCGAGC GCACCGTGCG GATCTCCGGT GCCGCGAAGT CCTTCAACGT GACCGGCTGG
AAGGTCGGCT GGATCACGGC GCCGCGCGAG CTCGCGGACG CCTGCCGGGC GGCCAAGCAG
TGGCTCACCT TCACCGGAGC GGCGCCGTTG CAGCACGCGG TGGCCCATGC TCTCGACTCC
GAGGGCGCGT GGCTGGCCCA GCTCGCACCC GATCTGCAGG CCAAGCGCGA CCTGCTCACC
TCGGCGTTGC ACGAGACCGG ATTCACGGTG CATCCGGCGG AGGGCACGTA CTTCGTGTGC
GCCGACGCCC GGGGGCTCGG CTACGACGAT GCGGGCGCGC TGTGCCGCGA GATGCCCGGG
CGGATCGGGG TGGCCGCGGT GCCCGTGAGC GCACTGGCCG ACGACCACGC CCGCTGGGGG
CACCTGCTGC GGTTCGCATT CAGCAAACAG GCCGATGTGC TGGCCGAGGG AACCCGCCGC
CTGGCCGCAC TCGGCGGCCG GACGCGGTGA
 
Protein sequence
MRTVHRLRPF MSTIFAEVST LAVEHDAVNL GQGFPDTDGP AGMLAAAKTA IDGGMNQYPS 
GDGLPELRRA VAAQVQRDYG LGYDPNGEVL VTVGASEGIA AAVLGLVEPG REVILIEPFY
DSYAATVAMA GAHRVVVPLV EDAGRYALDP AMLRAAVTEK TAAIIVNSPH NPTGTVLSHE
DLQLVAAVCV ERDLLCFTDE VYEHLLFDGR VHTPLATFEG MRERTVRISG AAKSFNVTGW
KVGWITAPRE LADACRAAKQ WLTFTGAAPL QHAVAHALDS EGAWLAQLAP DLQAKRDLLT
SALHETGFTV HPAEGTYFVC ADARGLGYDD AGALCREMPG RIGVAAVPVS ALADDHARWG
HLLRFAFSKQ ADVLAEGTRR LAALGGRTR