Gene Caci_3510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3510 
Symbol 
ID8334863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3914376 
End bp3915701 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID644956654 
Producthypothetical protein 
Protein accessionYP_003114257 
Protein GI256392693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3483] Tryptophan 2,3-dioxygenase (vermilion) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0468884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGG ACGAGCCGAG CCTGGAGCAG GATCTGGCGG CGTGGTCAGC GGACCCGGTC 
CCGGCGCACT TCCCGTACCT GCCGGTGGTC GGCGCCTTCC ACAGAGCCGG CAAGCACTTC
GTGGCCGCGA GCGTGCTCAA GCACCTCGAC GTCGCCCGCG CCACCCTGTC GCAGAGCCCG
TATCCGGACC CGGCGCTCGC GCGCTTCCTG GACGTCGTGC TCGACAAGTT CGACGAGCGC
TATGACTACC AGACCTACCT GGCCCTGAGC CTGATCCCGA TGCCCGAGAG CGGCGCGTCA
GCCCTCGAAG ACGCCACCAG CGGCGTCGCG CAACGGCAGC ACGACCGGCT GCTGGTCCAG
CTCGGCGCCG ACGCGCTCGC CTTCGAACTG GCCGCTCTGG ACGGACGCAC CGACCTGTTC
CCGCACCTGC GTCCCGATCC GCCGGTCGCA GCCAAGCGGT GCCGGCTCGG GCTGCGCTCC
CTGGGACCCG CTCTGGAGCG GCTGGGCCTG GCCGAGGGCC TGGAACCGGC GCTGCGTCCA
GGTTCGGAGG GCGACCCGCT CACCGCGGCG CGGCAGATCT GCGCGCGGGT CAGAGCCGAC
GCCTCGCCCG AGGAGCGGCG CGTCACCGAT CTGTCGATCC TGCCGGTATG GACCTCCCAC
GACGAGTACC TGTTCCTCAG GGTCCTGCAA ACCTTCGAGA CCCGCTTCGC GCTGCTCGCC
GTACGGTTGC AGGCCGCCCT GAACGCCCTG GCGATCGGAC GCCCGCGCCT GGCCGTCGCA
GAGGTCGGCA ACGCCCAGGC GGGACTGGAG GAATCCTTCC GGCTCTTCTC GCTCCTGGCG
ACCATGCAGA TCGAGTCGTT CCAAGAGTTC CGACAGTACA CCGAGGGCGC CAGCGCCATC
CAGTCGCGCA ACTACAAGCT CGTCGAATCG CTGTGCCGCG TCCCGGACGG AGACCGCCTG
GACTCCCCCG CGTACCGTTC GGTGCCCGAG CTGCGCGAAC GGGTCCTGCA AGACCCGCCG
AACCTGGACG ACGCGGTCTG GCTCGGCAGC CAGACCGGCG CGCTCTCGAG CACCGAGCGG
CGCGAAATGG CCGGCGCGCT GCAAGGCTTC GCCGCGCAGC TGCTGCAATG GCGCCAGACG
CACTACCGCC TGGCGGTACG GATGCTCGGC GACCGGCCGG GCACCGGCTA CACCGAGGGC
ACGCCCTACC TGAGGGAAGT GCGCACCATC CCGGTGTTCG CCAAGAGCAC TCCGGATATC
AGCGTACGTA CATCAGACCC TTCTGCAGGA CGAATGCCAC CGCGACCGTC CGGTTCGGGC
AGTTGA
 
Protein sequence
MSMDEPSLEQ DLAAWSADPV PAHFPYLPVV GAFHRAGKHF VAASVLKHLD VARATLSQSP 
YPDPALARFL DVVLDKFDER YDYQTYLALS LIPMPESGAS ALEDATSGVA QRQHDRLLVQ
LGADALAFEL AALDGRTDLF PHLRPDPPVA AKRCRLGLRS LGPALERLGL AEGLEPALRP
GSEGDPLTAA RQICARVRAD ASPEERRVTD LSILPVWTSH DEYLFLRVLQ TFETRFALLA
VRLQAALNAL AIGRPRLAVA EVGNAQAGLE ESFRLFSLLA TMQIESFQEF RQYTEGASAI
QSRNYKLVES LCRVPDGDRL DSPAYRSVPE LRERVLQDPP NLDDAVWLGS QTGALSSTER
REMAGALQGF AAQLLQWRQT HYRLAVRMLG DRPGTGYTEG TPYLREVRTI PVFAKSTPDI
SVRTSDPSAG RMPPRPSGSG S