Gene Caci_7721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7721 
Symbol 
ID8339097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8959101 
End bp8960489 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content69% 
IMG OID644960805 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003118386 
Protein GI256396822 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCA AGCTCATAGC CCTGTCGTCG GCGGCCGTGG CCGTGGCGCT CATGGCTTCC 
GCGTGCAGCT CGTCGAAGTC CAGCGGGTCC GACGCGGCCG GCAGCGGCGG CACCACGGCC
GCTCCCTCGT CGAACCAGCT CGGCACCGGC GGCACCGGGT CCACCGGTTC GGCGATCACG
ACGGACGGCA AGGGCAAGAC GGTCAACATC TGGCTGATGC AGGACGCGCA GAAGGGCTGG
CAGAACGTCG TCGACCAGGC CAACCAGCGC TTCACCGCGG AGACCGGCGC GCAGGTCAAG
ATCAACTGGC AGACCTGGAC CAACTACGGC CAGACCGTGG ACGCCGCCAT CGGCTCCTCC
TCGGCTCCGG ACGCCATCGA GCTGGGCAAC ACCCAGACCG CCAAGTACAT CGGCGCCGAC
CAGCTGGTCG ACCTCACCGG CGACAAGTCG AAGTTCGACA ACTCCGGCGC CTGGCTGGAC
AGCCTCGCGG CCTCCGGCGC CTCGCCGGAC GGCAGCAAGC AGTACGCGAT CCCCTACTAC
GCCGGCTCCC GCGTCCTGAT CTACCGCAAG GACCTGTGGG CCGCGGCGGG CGTGACCACC
GCGCCGACCA CGCTCGACGA GCTCAAGGCG GACCTGGACA AGGTCAAGGC GGCCAACACC
TCCACCGCCA ACTTCTCGGC GCTCTACCTG CCGGGCCAGA ACTGGTACAC CGCCATCTCC
TTCGGCGCCG GCAGCTACGG CGTCAACGGC GTCATCGCCA AGTCCAACGG CAGCAGCTGG
ACCGGCACGA TGACCGACCC GAAGTTCCTG ACGGGCATCT CGACGTGGGA CAGCCTGCAG
AAGAGCTACT CGGTCGGCGG CGCGACCAAG GACGAGACCG ACCAGGACGC GCTGATGGCC
AAGGGCAACA TCTCGGCCAT CATCGGCAAC GGCTGGGAGG CCGCGCAGGT CTACGACCCC
AAGGTCGGCG ACCCGACGCT GAAGGACAAG CTCGCTGAGA TCGCCGTCCC CGGCGTGACC
GCGGACGCCC CGACCCCGGC CTTCCTGGGC GGCTCGAACC TGGCGGTCCC GTCGAAGGCG
GCGAACTCCA AGCTCGGCGA GGAGTGGATC CGGATCTTCA CCGACACCGC CAGCATGAAG
CTGCTGGCGG CCAAGGCGAT CCCGAACAAC AAGACCCAGA TCGCCGACTA CATCGCCGCC
GACCCGGCCA ACCAGGCCAC CGGCGACGCG GCCAAGGGCG TCACCTGGTT CATCCCGAAC
TCGCCGAACT GGGCGCCGGC GGACGAGACG CAGCTGAAGA CCGCGTTCGG CCAGATCGCC
TCCGGGCAGG ACCCGGCGAC CGTGCTCAAG GGATTGCAGG ACAGCATCCT GAAGGACCTG
AACAGCTGA
 
Protein sequence
MRRKLIALSS AAVAVALMAS ACSSSKSSGS DAAGSGGTTA APSSNQLGTG GTGSTGSAIT 
TDGKGKTVNI WLMQDAQKGW QNVVDQANQR FTAETGAQVK INWQTWTNYG QTVDAAIGSS
SAPDAIELGN TQTAKYIGAD QLVDLTGDKS KFDNSGAWLD SLAASGASPD GSKQYAIPYY
AGSRVLIYRK DLWAAAGVTT APTTLDELKA DLDKVKAANT STANFSALYL PGQNWYTAIS
FGAGSYGVNG VIAKSNGSSW TGTMTDPKFL TGISTWDSLQ KSYSVGGATK DETDQDALMA
KGNISAIIGN GWEAAQVYDP KVGDPTLKDK LAEIAVPGVT ADAPTPAFLG GSNLAVPSKA
ANSKLGEEWI RIFTDTASMK LLAAKAIPNN KTQIADYIAA DPANQATGDA AKGVTWFIPN
SPNWAPADET QLKTAFGQIA SGQDPATVLK GLQDSILKDL NS