Gene Caci_5714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5714 
Symbol 
ID8337075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6594638 
End bp6596188 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content72% 
IMG OID644958818 
Productanthranilate synthase component I 
Protein accessionYP_003116413 
Protein GI256394849 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCT TCCCCCCAGG GGTGGACACC CCGGATCTGC CGACCTTCCG CGGGCTCGCC 
GAGGACCGCC GGGTGATCCC GGTGGTGCGG CGCGTGCTGG CCGACGGCGA GACGCCGGTC
GGGCTGTACC GCAAGCTCGC CGGCGAGCGG CCCGGCACCT TCCTGCTGGA GTCGGCCGAG
CACGGCATCT GGTCGCGGTA CTCCTTCGTC GGCGTCAGCA CCGGCGCGGC GCTGAGTGAG
CAGGACGGCG CGGCGCACTG GATCGGCGAG CCGCCGGTCG GGCTGCCGAC CTCCGGCGAC
CCCATGGACG TGCTGCGCGA GACCCTGGAG ATGCTGCACA CCCCGCGGCT GCCCGGGCTG
CCGCCGCTGA CCGGCGGGCT GGTCGGATAC ATGGGCTACG ACGCCGTGCG CCGCCTGGAG
CGGCTGCCGG ACCTGGCCAA GGACGACCTG CAGATCCCGG AGATGACGTT CCTGCTCAGC
CTGGACCTGG CGGTGCTGGA CCACGCCGAC GGCTCGGTCT GGCTGATCGC GAACGTCGTC
AATTATGACA ATCTGCCTAC CGGCGTCGAG GCGGCGTACA CCCGCGCCGT GGAGCGCCTG
GACGCGATGA CCGCCGCGCT GAACGCGCCG ATGGTCAACA CGCCGGTGGT GTACGACCCG
GGCGTCGCGC CGGAGTTCAG CGCCAACCGC ACCTCGGAGG ACTACCGCGA GACGGTGGTG
CGCTGCATCG AGGAGATCAA GTCCGGCGAG GCGTTCCAGA TCGTGGTGAG CCAGCGCTTC
GAGGCCGAGG TGCGGGCCTC GTCGCTGGAC GTCTACCGGG TGCTGCGGGC GACGAATCCG
AGTCCGTACA TGTACCTGCT GCGCGTCCCC GGCCCGGACG GCAGCGCCGA CGGCGGCTTC
GACATCGTCG GCTCCTCGCC CGAGGCGCTG GTGAAGGTGA CCGAGGGCCG CGCCATGCTG
CACCCGATCG CCGGCACCCG CCCCCGCGGC GCCGACCCCG AGCAGGACGC GCAACTCGCC
GCGGACCTGC TGGCCGACCC CAAGGAGCGC GCCGAGCACC TGATGCTCGT CGACCTCGGC
CGCAACGACC TGGGCCGCAT CAGCCGCCCG GGCTCGGTGG AGGTCGTGGA CTTCATGGCG
GTCGAGCGCT ACAGCCACGT CATGCACATC GTCTCCACGG TGATCGGCGA CCTGGCCGAA
GGCAAGACGG CCTTCGACGC CGTCACCGCG ACCTTCCCCG CCGGCACCTT GTCCGGCGCC
CCCAAACCGC GCGCGATGGA GATCATCGAG CACAACGAAC CGACCCGCCG CGGCCTGTAC
GGAGGCATCG TCGGCTACCT GGACTTCGCC GGCGACGCCG ACACCGCGAT CGCCATCCGC
ACGGTCCTGA TCCGCGACGG CATGGCCTTC GTCCAGGCCG GCGCCGGCAT CGTCGCCGAC
TCCGACCCCG CGGCCGAGGA CCAGGAATGC CGCAACAAGG CGATGGCGGT CCTGAAGGCG
GTCGCCGTCG CCGGGACGAT GCGGTCCGCG GTGGGGGAGG GGACGCGGTG A
 
Protein sequence
MTAFPPGVDT PDLPTFRGLA EDRRVIPVVR RVLADGETPV GLYRKLAGER PGTFLLESAE 
HGIWSRYSFV GVSTGAALSE QDGAAHWIGE PPVGLPTSGD PMDVLRETLE MLHTPRLPGL
PPLTGGLVGY MGYDAVRRLE RLPDLAKDDL QIPEMTFLLS LDLAVLDHAD GSVWLIANVV
NYDNLPTGVE AAYTRAVERL DAMTAALNAP MVNTPVVYDP GVAPEFSANR TSEDYRETVV
RCIEEIKSGE AFQIVVSQRF EAEVRASSLD VYRVLRATNP SPYMYLLRVP GPDGSADGGF
DIVGSSPEAL VKVTEGRAML HPIAGTRPRG ADPEQDAQLA ADLLADPKER AEHLMLVDLG
RNDLGRISRP GSVEVVDFMA VERYSHVMHI VSTVIGDLAE GKTAFDAVTA TFPAGTLSGA
PKPRAMEIIE HNEPTRRGLY GGIVGYLDFA GDADTAIAIR TVLIRDGMAF VQAGAGIVAD
SDPAAEDQEC RNKAMAVLKA VAVAGTMRSA VGEGTR