Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3510 |
Symbol | |
ID | 8334863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3914376 |
End bp | 3915701 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956654 |
Product | hypothetical protein |
Protein accession | YP_003114257 |
Protein GI | 256392693 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0468884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGG ACGAGCCGAG CCTGGAGCAG GATCTGGCGG CGTGGTCAGC GGACCCGGTC CCGGCGCACT TCCCGTACCT GCCGGTGGTC GGCGCCTTCC ACAGAGCCGG CAAGCACTTC GTGGCCGCGA GCGTGCTCAA GCACCTCGAC GTCGCCCGCG CCACCCTGTC GCAGAGCCCG TATCCGGACC CGGCGCTCGC GCGCTTCCTG GACGTCGTGC TCGACAAGTT CGACGAGCGC TATGACTACC AGACCTACCT GGCCCTGAGC CTGATCCCGA TGCCCGAGAG CGGCGCGTCA GCCCTCGAAG ACGCCACCAG CGGCGTCGCG CAACGGCAGC ACGACCGGCT GCTGGTCCAG CTCGGCGCCG ACGCGCTCGC CTTCGAACTG GCCGCTCTGG ACGGACGCAC CGACCTGTTC CCGCACCTGC GTCCCGATCC GCCGGTCGCA GCCAAGCGGT GCCGGCTCGG GCTGCGCTCC CTGGGACCCG CTCTGGAGCG GCTGGGCCTG GCCGAGGGCC TGGAACCGGC GCTGCGTCCA GGTTCGGAGG GCGACCCGCT CACCGCGGCG CGGCAGATCT GCGCGCGGGT CAGAGCCGAC GCCTCGCCCG AGGAGCGGCG CGTCACCGAT CTGTCGATCC TGCCGGTATG GACCTCCCAC GACGAGTACC TGTTCCTCAG GGTCCTGCAA ACCTTCGAGA CCCGCTTCGC GCTGCTCGCC GTACGGTTGC AGGCCGCCCT GAACGCCCTG GCGATCGGAC GCCCGCGCCT GGCCGTCGCA GAGGTCGGCA ACGCCCAGGC GGGACTGGAG GAATCCTTCC GGCTCTTCTC GCTCCTGGCG ACCATGCAGA TCGAGTCGTT CCAAGAGTTC CGACAGTACA CCGAGGGCGC CAGCGCCATC CAGTCGCGCA ACTACAAGCT CGTCGAATCG CTGTGCCGCG TCCCGGACGG AGACCGCCTG GACTCCCCCG CGTACCGTTC GGTGCCCGAG CTGCGCGAAC GGGTCCTGCA AGACCCGCCG AACCTGGACG ACGCGGTCTG GCTCGGCAGC CAGACCGGCG CGCTCTCGAG CACCGAGCGG CGCGAAATGG CCGGCGCGCT GCAAGGCTTC GCCGCGCAGC TGCTGCAATG GCGCCAGACG CACTACCGCC TGGCGGTACG GATGCTCGGC GACCGGCCGG GCACCGGCTA CACCGAGGGC ACGCCCTACC TGAGGGAAGT GCGCACCATC CCGGTGTTCG CCAAGAGCAC TCCGGATATC AGCGTACGTA CATCAGACCC TTCTGCAGGA CGAATGCCAC CGCGACCGTC CGGTTCGGGC AGTTGA
|
Protein sequence | MSMDEPSLEQ DLAAWSADPV PAHFPYLPVV GAFHRAGKHF VAASVLKHLD VARATLSQSP YPDPALARFL DVVLDKFDER YDYQTYLALS LIPMPESGAS ALEDATSGVA QRQHDRLLVQ LGADALAFEL AALDGRTDLF PHLRPDPPVA AKRCRLGLRS LGPALERLGL AEGLEPALRP GSEGDPLTAA RQICARVRAD ASPEERRVTD LSILPVWTSH DEYLFLRVLQ TFETRFALLA VRLQAALNAL AIGRPRLAVA EVGNAQAGLE ESFRLFSLLA TMQIESFQEF RQYTEGASAI QSRNYKLVES LCRVPDGDRL DSPAYRSVPE LRERVLQDPP NLDDAVWLGS QTGALSSTER REMAGALQGF AAQLLQWRQT HYRLAVRMLG DRPGTGYTEG TPYLREVRTI PVFAKSTPDI SVRTSDPSAG RMPPRPSGSG S
|
| |