Gene Caci_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2124 
Symbol 
ID8333469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2409269 
End bp2410471 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content66% 
IMG OID644955274 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112884 
Protein GI256391320 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR03227] 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG TACTGTCCCG CCCGCGTACC CGCCGTGTCG GCGCGCTGAT CGCCGTCGGC 
GGCCTGACGC TGACCGGACT GACGGCTTGT GCCTCTTCCA AGAGCTCCTC CGGTGCCGCC
GCCGGCAGCA CCGCCACCTC CGCCGCTGCT GCGGCCGCGT CCTCCTCCTG CCCGGCGCTC
GGTGCCCCAG CGTCGACCGC GCCTGCGAAG GCCGACGGCT CCGGCGGGCA GGTGACCATC
TACAGCGCCG ACGGTCTGTA CGACGCGAAG GATGACAAGA ACTGGTACAA CCAGGAATTC
AAAAAGTTCA CCGCCCTGAC CGGCATCCAC GTGAACTACT CCGAGGACGG CTCCGGCGGC
GTGGAGACCA AGGTCGACTC CGAGAAGTCG AACCCGAAGG CCGACGTCAT CGTGACCCTG
CCGCCGTTCA TCCAGAAGGC TGAAGCCTCC GGGCTGCTGC AGGCGTACAG CCCGGCGTGT
GTGGACAAGG TCGACCCCTC GCTCGTGGAC AAGAACGGCG AGTGGGAAGC GGTGATGGGC
AACTACCTGT CCTTCATCTA CAACACCAAG GCGCTGCCCG ACGGCCCGCC GAAGACCTGG
AACGACCTGC TGGACCCGAA GTTCAGCAAG AAGTTGCAGT ACTCGACGCC GGGTGTGGCC
GGTGACGGGA CGGCGGTGAT GATCGCGGCG ATCCACGCCT TCGGCGACAA CCGCGACTCC
GCCTGGAGCT TCTTCAAGCA GCTGCAGTCG AACAACGTCG GGCCGTCGAA GTCCACCGGC
GCGCTGGAGA GCAAGGTCAA CACCGGCGAC CTGCTGGTGG CGAACGGCGA CGTGCAGATG
AACTACGTCG ACAGCACGAC GCAGTACCCG AACAACAAGA TCTTCTTCCC GGCGGGCAAC
GACGGCAAGC CAAGCACGTT CTCGCTTCCG TATATGGCGG GCTTGGTCAA GGGCGCGCCC
CATGCCGACA ACGGCAAGAA GCTGATCGAC TTCCTGCTGT CCGAGGGCGC GCAGCTGGAC
GCTTCCAAGG TGGCGTATGG CTTCCCGGCG CGTACCGACG TCAAGCCCAC GGACAGCAAC
TACGCGGCTT TGAACGCGCT GCTTCAGGGC GTGACCGTCT TCCCGGTGGA CTGGAACGAG
GTCGCGCAGA ACTACAACAG CGACGTCAAG GCGTGGGACA CCGCCACCGG CACGCCGAGC
TGA
 
Protein sequence
MSHVLSRPRT RRVGALIAVG GLTLTGLTAC ASSKSSSGAA AGSTATSAAA AAASSSCPAL 
GAPASTAPAK ADGSGGQVTI YSADGLYDAK DDKNWYNQEF KKFTALTGIH VNYSEDGSGG
VETKVDSEKS NPKADVIVTL PPFIQKAEAS GLLQAYSPAC VDKVDPSLVD KNGEWEAVMG
NYLSFIYNTK ALPDGPPKTW NDLLDPKFSK KLQYSTPGVA GDGTAVMIAA IHAFGDNRDS
AWSFFKQLQS NNVGPSKSTG ALESKVNTGD LLVANGDVQM NYVDSTTQYP NNKIFFPAGN
DGKPSTFSLP YMAGLVKGAP HADNGKKLID FLLSEGAQLD ASKVAYGFPA RTDVKPTDSN
YAALNALLQG VTVFPVDWNE VAQNYNSDVK AWDTATGTPS