Gene Caci_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2219 
Symbol 
ID8333565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2518714 
End bp2519760 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID644955370 
ProductChorismate binding-like protein 
Protein accessionYP_003112979 
Protein GI256391415 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.294464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00136652 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCTCCC CTGCTCGCCG CCCGCCTCCG ATATCCGACC CCGAGCCGCT GGCCCACTTC 
GCCGGCCGAC TCGCGACCGG GCTGCGCGAC GTTACCTCGG ACCTGAGCGC CCTGGATTCC
TCCGGCTTCT GGGCCGTCGT GGGCACCTAC GAAGGCGATT GGACCCTGGC GCGCTTCGAC
GACGTCCGGG ACGCCCCGCT CCCCTCGCCA AGCACGCCCT GGATCGGTCC GGACCGCGAC
GCGTGGATCT CCTCCATGGA TCAGGGCGCA TACGTGGGAG CTGTCAAAGC CATCCGCGAC
GAGATCGCCG CCGGCGAGGT CTACCAGGTG AACCTCTGCC GCGTCCTGTC CGCGCCGATC
GCTTCGCAGG CCGAACCCCT CGCCCTCGCC GCGCGCCTAC GCACGGGGAA CCCCGCTCCG
TACGCCGGTC TGGTCAACGT CCCCGGCACC CGCGTGGTGA CCGCCTCCCC CGAGCTCTTC
CTGCGCCGCG ACGGCCGCAC CGTCACCTCC GAGCCCATCA AGGGCACCGC GCGGACCGAG
GAGGAATTTC TCCCGAAGGA CACCGCCGAG AACATCATGA TCGTCGACCT GGTCCGCAAC
GACCTGGCGC GGGTCGCCGA GATCGGCTCC GTCGAGGTCC CCGCGCTGCT GCGTGTCGAG
CCGCATCCCG GGCTCGTGCA CCTGGTCTCG ACCGTGACCG CCGAGCTGAC CGCCGATGTG
GGCTGGCTAG AGCTGGTCGC GGCGACCTTC CCTGCGGGCT CCATCACCGG GGCGCCGAAG
AGCAGCGCGT TGCGCATCAT CGACGAGCTG GAGAACGCGC CGCGCGGTCC GTATTGCGGT
GCCGTCGGGT GGGTGGACGC CGATCGTGGT GTCGGCGAGC TGGCGGTGGG CATCCGTACG
TTCTGGTGGC AGGACGACCG CCTGTGTTTC GGCACCGGTG CCGGCATCAC GTGGGGGTCG
GATCCGCAGG GAGAATGGGA CGAGACCGAG CTCAAGGCCG CGCGGCTGCT GGCGGTCGCG
TCGGGACCGC GGCCGGCCGC GCAGTGA
 
Protein sequence
MPSPARRPPP ISDPEPLAHF AGRLATGLRD VTSDLSALDS SGFWAVVGTY EGDWTLARFD 
DVRDAPLPSP STPWIGPDRD AWISSMDQGA YVGAVKAIRD EIAAGEVYQV NLCRVLSAPI
ASQAEPLALA ARLRTGNPAP YAGLVNVPGT RVVTASPELF LRRDGRTVTS EPIKGTARTE
EEFLPKDTAE NIMIVDLVRN DLARVAEIGS VEVPALLRVE PHPGLVHLVS TVTAELTADV
GWLELVAATF PAGSITGAPK SSALRIIDEL ENAPRGPYCG AVGWVDADRG VGELAVGIRT
FWWQDDRLCF GTGAGITWGS DPQGEWDETE LKAARLLAVA SGPRPAAQ