Gene Caci_3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3347 
Symbol 
ID8334700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3697199 
End bp3698599 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content73% 
IMG OID644956492 
Productamino acid/amide ABC transporter substrate- binding protein, HAAT family 
Protein accessionYP_003114095 
Protein GI256392531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.716666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.862215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTGA GCAGGCGCCG CCGGCGCGTC GTCCCGCTCC TGATCGCGCC CCTGGCGCTG 
GCGGCGCTCG CGGCGGCGGC CACCGCCTGC GGCACTCGCC TGCCCGCCAG CGCCTTCGCC
CCCCGCCCCG GGCCGACGGC GCCCGGCAGC ACGCAGCCGC CGGCGACAGC GCCGGCGAAC
CAGAACCCCG CCAGCGACGT CGGCGTCACC CCGACCCAGA TCCGCGTCGG CATCCTCGCC
TCGCTGACCA GCCCCGTCGG CTCCGCCGCC TTCAGCGGTC CCAGCTACGG CGCGCAGGCC
TTCTTCCGCG CGCTCAACGC CGCCGGCGGC GTCCACGGCC GGACCGTCGC GGTCTCCGTG
TGCGACGACG GCGGCAGCGG TATCGGCAAC CAGGACTGCG TCCACCAGCT CATCGACCAC
GACCAGGTCT TCGCCCTGAC CGCGACCGCG GCGCTGGACT ACGCAGGGGC GGACTACGTC
AGCAAGAAGG CCGTCCCCGA CATCGGCGGC CAGCCCATCA CCACCGTCTA CGACCAGTAC
CCGCACCTGT ACGCGATCGG CGGCAGCAGC TCGCCGCGCG ACGGCCGCAC CGTCGGCTGG
AACGGCACCC TGTATCAGAG CACGGAGATC TTCCGCTTCT TCAAGCAGCG CCTGGGCTCG
GCGCGCGCCG CGGTCGTCGC CTACAACCAG GCCGACTCCA CGCGCTACGC CTCCCAGCTC
GCCGCCGGTC TGCGCGCCGA GGGGTACCAC GTCCTGTCCC AGACCGTCGA CCTGGCGCTG
CCGGGCTTCC AGGCGGTCGC GGCGGCGATG AAGGCCGACG GCAGCCAGCT GTTGTTCGAC
GCCATGGACA CCCGGGGCAA CGCAGCACTC TGCAACGCCA TGGACGCCGC CGGTGTCAGG
GTCCTGGCAA AGGTCACCAA CGTCGAGAAC TGGGGCGAGT CGGTCCGCGA GGACTACCGC
TCCTCCCCCG CCTGCCGCAA CGTGCTGTGG GCGACCTCCT CCAGCCGGAA CTACGAGGAC
GTCCAGTACC CGGCGGTCGC GCAGTTCCGC GCGGCGATGG CGCGGTACTT CCCGGACCAG
GCGTCCCAGC TGTCCGCCTG GGACCTGGAG GGCTGGGCCG CGGCGCAGTG GCTCACCGAC
GCCATCGACT CCTGCGGCGC GAACGTGACC CGCGCCTGCG TCGAGGGTTT CATGAACCGC
CCGCAGCCCT ACGACGCGCA CAACCTGATC CTGCCGGCGT CCTTCATCCC GACTCCCCCG
CCGACCGGGA CCACCCGCGC CTGCCTGAAC GCGGCGCGCT GGCAGGACTC GGCGCAGGGC
GGCAGGGGCG GGTGGGTGAC GCAGGTCGCT GACATGGACA CCACCTGCTT CGACGTGCCG
CAGCTGCCGT ACACGCCGTG A
 
Protein sequence
MQLSRRRRRV VPLLIAPLAL AALAAAATAC GTRLPASAFA PRPGPTAPGS TQPPATAPAN 
QNPASDVGVT PTQIRVGILA SLTSPVGSAA FSGPSYGAQA FFRALNAAGG VHGRTVAVSV
CDDGGSGIGN QDCVHQLIDH DQVFALTATA ALDYAGADYV SKKAVPDIGG QPITTVYDQY
PHLYAIGGSS SPRDGRTVGW NGTLYQSTEI FRFFKQRLGS ARAAVVAYNQ ADSTRYASQL
AAGLRAEGYH VLSQTVDLAL PGFQAVAAAM KADGSQLLFD AMDTRGNAAL CNAMDAAGVR
VLAKVTNVEN WGESVREDYR SSPACRNVLW ATSSSRNYED VQYPAVAQFR AAMARYFPDQ
ASQLSAWDLE GWAAAQWLTD AIDSCGANVT RACVEGFMNR PQPYDAHNLI LPASFIPTPP
PTGTTRACLN AARWQDSAQG GRGGWVTQVA DMDTTCFDVP QLPYTP