Gene Tpau_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2044 
Symbol 
ID9156199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2128832 
End bp2130397 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content70% 
IMG OID 
Productanthranilate synthase component I 
Protein accessionYP_003646995 
Protein GI296139752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.874446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC AATCGGCTTC GAACCTCCAG CCCGACGGCG CCACCACCAG CCTCGATGAG 
TTCTTGGAGC TCGCCGAACG GCATCGCGTG GTACCCGTGA CCCGCACCGT GCTCGCCGAT
GCCGAGACCC CGCTCTCGGC GTACCGCAAA CTCGCGGGCG GACGCCCCGG CACCTTCCTG
CTGGAATCCG CGGCGGCCGG CCAATCCTGG AGTCGATGGT CGTTCGTCGG TGCGCCCGCT
CCGGCGGCGC TGACGGTGGT CGACGGTGAA GCGGCGTGGA TCGGCACCAA GCCGGCGGGC
GCGCCATCGG GCGGCGATCC GATCGACGCG CTCGGCGAGG TGCTGCGCCT GCTCAGCTCA
GAGCCGATCG AGGGGCTTCC GCCGCTCAAC GGCGGGATGG TGGGCTACCT CGCGTACGAC
GCCGTGCGCC GGTTGGAGCG TCTGCCCGAG ACCACGGTCG ACGATCTCGG CCTGCCGGAG
ATGGTGTGGC TGCTCGCGAC CGATCTCGCC GCGATCGACC ACCACGAGGG CACCATCACC
CTCATCGCCA ACGCGGTGAA CTGGGACGGC GCCCGCGAAC GTGCGGCCGA GGCCTATGCC
GACGCGGTCA CCCGGCTCGA TGCGATGGAG ACCGCACTGG CGGCGCCTCT GGCCTCGTCG
GTGGCCTCGT TCACCCGGCC CGAGCCCGCC TACAGCGCGC AACGGACGGT CGAGGAGTAC
AGCGCGATCG TTGAGAAGCT GGTCGGCGAC ATCGAGGCCG GCGAGGCCTT CCAGGTGGTG
CCGTCGCAGC GCTTCTCGGT GCCGTCCCAC GCCGATCCGA TCGACGTCTA CCGCGTGCTG
CGGGCATCGA ACCCGAGTCC CTACATGTAT CTGGTCCAGG TACCGGCGCC GGACGGATCG
CTCGCTTTCT CGATCGTCGG CTCCAGCCCG GAGGCGCTGG TCACGGTCAG CGATGGCACC
GCGACCACCC ACCCGATCGC AGGTACGCGA TGGCGCGGGG CGTCCGCGGA GGAGGACCTG
CTGTTGGAGA AGGACCTGCG CGCCGACGAG AAGGAAAACA GCGAGCACCT CATGCTCGTC
GACCTCGGCC GCAACGACCT CGGACGCGTG TGCACGCCGG GCACCGTCCG CGTCACCGAC
TACCGGCGGA TCGAGCGCTA CAGCCATGTG ATGCACCTGG TCTCGACGGT CTCCGGTGAC
CTCGCCCCCG ACAAGCAAGC GCTCGACGCG GTCACCGCCT GCTTTCCCGC CGGCACGCTC
ACCGGGGCGC CGAAGGTCCG GGCGATGGAG CTGATCGACG AGGCCGAGCT GACGCGGCGC
GGTCTCTACG GCGGCATCGT CGGCTACCTG GATTTCGCCG GTGACGCCGA CACCGCGATC
GCGATACGCA CCGCCGTGCT CAAGGACGGC ACCGCCTTCG TCCAGGCCGG AGGCGGCGTG
GTCGCGGATT CGGTCGGTGA ATACGAGTAC AACGAATCCC GGAACAAGGC GCTCGCCGCG
CTCAAGGCGG TGGCGGCGGC CAATACGCTG CGCGCGGTCA CCGAGACGGA GGGGGAGGGC
CGATGA
 
Protein sequence
MSTQSASNLQ PDGATTSLDE FLELAERHRV VPVTRTVLAD AETPLSAYRK LAGGRPGTFL 
LESAAAGQSW SRWSFVGAPA PAALTVVDGE AAWIGTKPAG APSGGDPIDA LGEVLRLLSS
EPIEGLPPLN GGMVGYLAYD AVRRLERLPE TTVDDLGLPE MVWLLATDLA AIDHHEGTIT
LIANAVNWDG ARERAAEAYA DAVTRLDAME TALAAPLASS VASFTRPEPA YSAQRTVEEY
SAIVEKLVGD IEAGEAFQVV PSQRFSVPSH ADPIDVYRVL RASNPSPYMY LVQVPAPDGS
LAFSIVGSSP EALVTVSDGT ATTHPIAGTR WRGASAEEDL LLEKDLRADE KENSEHLMLV
DLGRNDLGRV CTPGTVRVTD YRRIERYSHV MHLVSTVSGD LAPDKQALDA VTACFPAGTL
TGAPKVRAME LIDEAELTRR GLYGGIVGYL DFAGDADTAI AIRTAVLKDG TAFVQAGGGV
VADSVGEYEY NESRNKALAA LKAVAAANTL RAVTETEGEG R