Gene Tpau_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3944 
Symbol 
ID9158125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4064180 
End bp4065625 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content69% 
IMG OID 
ProductCarotenoid oxygenase 
Protein accessionYP_003648855 
Protein GI296141612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC ACAAGCAGAT CGGCGTCGAT CGGTCCCGCG TCTTCGAGCC GGCGGCGGGT 
GAGTTCGACG ACGTGCTGAT CGAGGACATC ACCGGCGCGC TCCCCGAGGA TCTGCTCGGG
ACGCTGTACC GCAGCGCACC GGTGCGCTGG GAAGCCGGCG GCTTCTACGC CAAGCACCTG
TTCGACGGTG ACGGGATGGT CGCGAAGTTC TCCCTGGAGG GCGGCCGGCT GCGGTACCGG
AACCGGTACG TCCGCACCCC CAAGTACCAG CGGGAGGAGG CCGGGCGAGG GGACCGAGTA
CGTGGGCTCG GCACCCTCGC CGCGGGGGGT CCACTCGGTA ACGCGGGTCG CGGCCCCGCT
GATCGCGCCA ACACCACGTC GATCTACCAC GGTGACCGGC TGGTCGCGCT CTCCGACGAC
GGCCGGCCGT GGGAGCTCGA TCCCGACTCG CTGGCGACGC GCGGTCGGTG CGATTTCGAC
GGCGCGCTGA GCCTGTTCAG CACGTTCTCG CCACACCCGA AGCTCGATCC GGTCTCGGGC
GAGATGTTCA ACTTCGGCCT CACGCTGCCG CCGGTGGGAC GCAGCACCGG CATGGTGGGC
CTGCGCTGCT ACCGGGTCGA TCCGCGCGGT CGGCTGAGCA CGATCCGCAC AGTGCCGCTC
TCGCACGTGC TGATCAATCA CGACTTCGCG ATCACCGAGC GGTACCTGGT GTTCGTCCTG
GACCCGCTGA CCGTGGGGAC GGTCGCGACG GCGCGAGCCG GCCTGGGCAT GATCGACTTC
AACGCGGCCA CCGAGTTCCG CGCGGAGCTC GGGTCCGAAG TGATCCTGGT GCCACGGGAC
GGCGGTGAGC CGCGCCGGTT CACCATCCCC GCGTTCCCGA AGGTGCACGT GAACAACGCC
TACGAGGTGG GCGGCGACGT GCTGATCGAT GTGGTCAAGT ACGACGACTG GAACGAGACC
GCGCGGATGC TCTGCGACTT CCGCGAATAC GGCCCTGCTC GGGGCGGCAC GCTCACGCGG
TTGCGGATCA CGCCGAGCGA TCGGGTGGAA CTCACCGAGC TATCGGCCTC GCTGGGCGAA
TTCCCGATGC ACGACTGGCG CCGCACCGGA CGACGATTCC GCTACTCGTA TCTGATGGAG
TCCGACGGGG TGCGTCCGCC GTCGCTGGAG AAGATCGACA ACGAGACCGG CGAGCAGACC
TCGATGGGCG GGTTCGCCTT GTTCGACGGT GTCGGTGAGC CGATCTTCGT GCCGCGCAGT
CCCGATGCCG CCGAGGACGA CGGCTGGCTT CTGGTGGTGG TGTATCTGGC GTCGGACCAC
CGCTCGGCGC TGATCGTCGT GGACGCTCGG GACATCGAGG CGGGGCCGGT GGCGGTAGCA
CGGCTGCCGC ACCACTTCTT CCCCGGATTC CACGGCATGT TCACCGACCG GGTCTCCCTC
ACCTAG
 
Protein sequence
MTTHKQIGVD RSRVFEPAAG EFDDVLIEDI TGALPEDLLG TLYRSAPVRW EAGGFYAKHL 
FDGDGMVAKF SLEGGRLRYR NRYVRTPKYQ REEAGRGDRV RGLGTLAAGG PLGNAGRGPA
DRANTTSIYH GDRLVALSDD GRPWELDPDS LATRGRCDFD GALSLFSTFS PHPKLDPVSG
EMFNFGLTLP PVGRSTGMVG LRCYRVDPRG RLSTIRTVPL SHVLINHDFA ITERYLVFVL
DPLTVGTVAT ARAGLGMIDF NAATEFRAEL GSEVILVPRD GGEPRRFTIP AFPKVHVNNA
YEVGGDVLID VVKYDDWNET ARMLCDFREY GPARGGTLTR LRITPSDRVE LTELSASLGE
FPMHDWRRTG RRFRYSYLME SDGVRPPSLE KIDNETGEQT SMGGFALFDG VGEPIFVPRS
PDAAEDDGWL LVVVYLASDH RSALIVVDAR DIEAGPVAVA RLPHHFFPGF HGMFTDRVSL
T