Gene Caci_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1722 
Symbol 
ID8333065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1952625 
End bp1954055 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content68% 
IMG OID644954872 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003112484 
Protein GI256390920 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.902381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCAC ATTCCGCACG GTCCGGAAAC CTTCGAACCC TGATAGCGGT CGGCGCGGCC 
GTCGCCGCCC TCGTCGCCGG CTGCTCCAGT TCCTCCTCCG GTTCCAAGCC GGCCGCCGTG
AACTCCGCCG ACCAGCCGAA GAACCCCACG CTGGTGATCA CCGCCAACGA CATCGCGGGC
GGGAAGAACA GCAACGAGGC GAACTGGATC CAGAAGACGC TGATCCCGGA CTTCGTGAAA
GCCGAGGCGG CCAAGGGGAT CACCGCCCAC GTCACCTTCC GGCCCAACGG CGTGGACGAC
AACGCCTACA AGTCCAAGCT CGCCCTGGAC CTGCAGTCCG GCACCGGCGA CGACGTCTTC
TCCCTGGACG GCATCTGGGT CGGCGAGTTC GCCGACGCCG GCTATGTCAA GCCGCTGAAT
CAGGTGGCCG GAGCCCAGGT CGACAGCTGG GACGGCTGGT CGCAGATCAC CCAAGCCGTC
CAAGCCCTCG GCGAGTACCA GGGCAAGCGT TTCGGCGTCC CGAACGGCAC CGACGCCCGC
GTCATCTTCT TCAACAAGAA GCTGTTCGCG CAGGCCGGGC TGCCCGCCGA CTGGCAGCCG
ACGAGTTGGC AGGACCTCTA CGACGCGGCC GCCAAACTCA AGACCCTGCC AGGGGTCACC
CCGGTCCAGT GGGACGGCGG CGTCCCGATG GGCGAGGCCA CGACGATGCA GGGCTTCCTG
CCGCTGCTGT CCGGCGCGGG CGGCTCGTTG TGGGCGAACG GCAAGTGGAT GAAGGCCGGC
ACGGCGTTCA CCTCGGCGCT CGGCTTCTAC CAGAAGATCT ACGGCGGCGG TTACGGCGAT
CCGGTGTTGC AGGAGGACGC CAAGGGCCGC GACAAGTCCT TCACCGAGTT CGCGGCGAAC
AAGATCGGCA TCTACGCCGA GTCCGACTAC ATGTGGCGCT CGGTCCTGAA TCCCACCGGC
GGCACCGCGC CGATGGCCGA CCGCGACACC GATGTCGGCT ACGCGCTGAT CCCGTCGCAG
ACCCCGAGCT CGGGTGTGAA GGGCCAGGGC TTCGTGTCCT ACTCGGGCGG TTCGGACTGG
TCCATCAATC CCAAGACCAA GTATCCGCAG GCGGCGTGGG ACTTCCTGGC GTTCCTGAAC
TCCAAGACCG AGACCGAGTC CCGGATCAGC GGCGCGCCGC TGCTCACCGC CCGGACCGAC
GTGAACCAGC AGGTGCTCGG CAACGACCCG ATGCTGAAGT TCGCCACCGA CAAGGTGCTG
CCGATCACCG CGTTCCGGCC GTCGCAGGCG GCGTACAACG ACGTGTCGAG CCTGGTCCAG
AAGGCGGTCG CGGACGTGGT CGGCGGCAAG AGCCCGGAGC AGGCCGCGGC GGCCTACGAG
AAGGCGTTGG AGGGGCTCGT TGGCGCGGAC AGCATCGCCG CGGGCAGCTG A
 
Protein sequence
MGAHSARSGN LRTLIAVGAA VAALVAGCSS SSSGSKPAAV NSADQPKNPT LVITANDIAG 
GKNSNEANWI QKTLIPDFVK AEAAKGITAH VTFRPNGVDD NAYKSKLALD LQSGTGDDVF
SLDGIWVGEF ADAGYVKPLN QVAGAQVDSW DGWSQITQAV QALGEYQGKR FGVPNGTDAR
VIFFNKKLFA QAGLPADWQP TSWQDLYDAA AKLKTLPGVT PVQWDGGVPM GEATTMQGFL
PLLSGAGGSL WANGKWMKAG TAFTSALGFY QKIYGGGYGD PVLQEDAKGR DKSFTEFAAN
KIGIYAESDY MWRSVLNPTG GTAPMADRDT DVGYALIPSQ TPSSGVKGQG FVSYSGGSDW
SINPKTKYPQ AAWDFLAFLN SKTETESRIS GAPLLTARTD VNQQVLGNDP MLKFATDKVL
PITAFRPSQA AYNDVSSLVQ KAVADVVGGK SPEQAAAAYE KALEGLVGAD SIAAGS