Gene Caci_3667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3667 
Symbol 
ID8335020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4106330 
End bp4107823 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content71% 
IMG OID644956807 
ProductAnthranilate synthase 
Protein accessionYP_003114410 
Protein GI256392846 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.444174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.983717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCTCGC CATCGATCCC GGTCCGGGTG ACCGTTACCG AATTGCCAGG CGGCAATCCC 
CTTCAAACCT ATGAAGAACT GCTGCGCGAT CGGCCTGGCC ACGACGTGTT CCTCTTCGAG
AGTCCGGCCG GTCCGGTCCA GGACCGGCGC TGGGCGGCGG TCGGCTGGGA CCGGCTCGCC
GAGATCCGGC TGTACGCCGG ACGCGTCGAG CTGACCGCCG GGGCCGCTCT CGGAGATCTG
CTCGCCGAGA CCGTCTCGGC CGAGGCCGGG CCGCCCAAAG AAGCCTATGA CGGCATCCGG
GTCTGGGACG CGTCCGCCCC GGAGACCAGA TGGCGCCTGC TGGCGGCGAT CACCGAGGCC
TTCGACGTCG ACACCGACCT GGGCCAGGAC ACCTTCGCCT TCGGATTCCT GACCGTCCTG
TCCTACGAGG CCGCCTGGGA CATCGAGGAC ATCCCGCAGA CCCAGCAAGA CGCTGACATG
CCCCGGTGCA CCCTGGCCCT GTTTCGGAAC ACGGTCTGGT ACGACCAGCA CAGCGGCGGG
GTGCGCCTGC TGTCCGCCTC CGGCCCGCAC TTCCCCGCGG CCGGGAGTGA GCAAGCTCTC
ACGCCGGCCG CCGCGAGCGG GGAATCAGAG CCCTCCATCA TCCCCGCCGC CCCGGCGCCG
CGTTCGGTGA CCGACAACGT CGAGCAGGAA ACGTTCACCG GCCGCGTCGA GCGCTGCCTG
CGCCACATCG GCGTCGGCGA CAGCTACCAG ATCCAGATCG GGCACCGGAT CGACGTCGAG
ACCGACCTGA CCCCGGTCGA GGTCTACAAA AGGCTGCGCC ACCGGAATCC GTCCCCTTAC
ATGTACCTGA CGCCCTGGAG CGGCCGGACC GTGATCGGCG CCAGCCCGGA GCTGTTCTTC
CGGATCGAGG GCCGCCGCAT CCTCATGCGC CCCATCGCCG GGACCGCGCC GCGGGCCGCC
GATCCGGCCG AGGACGAGCG CCGGGTCGCC GAGCTGCGCG CCAGCGCCAA GGAACAGGCC
GAGCACGTCA TGCTCGTCGA CCTGTGCCGC AACGACATCG GCCGGGTGTG CGTGCCCGGC
ACCCTCCAGG TCGAGACCAT GATGGAGGTC GAGCCCTTCG CCTACGTCCA CCACCTGGTC
TCCACGGTCT CCGGAGACCT CGAGGACGGC GTCGGGGTGT GGGAGGCCGT CCGCGCGACG
TTCCCGGCCG GCACCATGAC CGGCGCGCCG AAGGTGCGCG CCATGGAGAT CATCAACGAG
ATCGAGGACG ACCCGCGCGG CGCCTACGCC GGCGCGCTCG GCCTGATCGA CGTGCGCGGC
TTCGCGGTGC TCGCCTTGTG CATCCGCACC ACCGTCCACG ACGGCCGCGC CTACAGCACC
CAGGCCTCCG CCGGCGTGGT CGCCGACTCC GTCCCGGCCT CCGAATGGCG CGAGACGCTG
GCCAAGATGA GCGCCACCTA CTGGGCCCTG ACCGGCGAGG AGCTCCTGTC GTGA
 
Protein sequence
MLSPSIPVRV TVTELPGGNP LQTYEELLRD RPGHDVFLFE SPAGPVQDRR WAAVGWDRLA 
EIRLYAGRVE LTAGAALGDL LAETVSAEAG PPKEAYDGIR VWDASAPETR WRLLAAITEA
FDVDTDLGQD TFAFGFLTVL SYEAAWDIED IPQTQQDADM PRCTLALFRN TVWYDQHSGG
VRLLSASGPH FPAAGSEQAL TPAAASGESE PSIIPAAPAP RSVTDNVEQE TFTGRVERCL
RHIGVGDSYQ IQIGHRIDVE TDLTPVEVYK RLRHRNPSPY MYLTPWSGRT VIGASPELFF
RIEGRRILMR PIAGTAPRAA DPAEDERRVA ELRASAKEQA EHVMLVDLCR NDIGRVCVPG
TLQVETMMEV EPFAYVHHLV STVSGDLEDG VGVWEAVRAT FPAGTMTGAP KVRAMEIINE
IEDDPRGAYA GALGLIDVRG FAVLALCIRT TVHDGRAYST QASAGVVADS VPASEWRETL
AKMSATYWAL TGEELLS