Gene Caci_2783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2783 
Symbol 
ID8334132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3195420 
End bp3196715 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID644955931 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003113537 
Protein GI256391973 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.185157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTAT CGCGCAGCAG TCGGCGGAGG GCGGTCGCGA TATCGGCCGC GCTCGCCGCC 
GTGTCGTTGG GTGTGGCCGG GTGCGGCAAG TCGACCGCGT CGGGTTCGTC GTCGAAGGTG
CTCAAGCTCT GGCACTACGA GGCCGCCGAC AGCGCGATGG GTGTCGCGTG GAACCAGGCG
ATCAAGGAGT TCGAGCAGAA GCACCCCGGC GTGACGGTGC GGTTCGAGCA GAAGACCTTC
GAGCAGTTGG AGAAGACGGC GCCGATGGTG CTGAACTCCT CCGACGCCCC GGACATCTTG
GAGTACAACA AGGGCGACGC GACCGCGGGG CTGCTGGCCA AACAGGGACT GCTCACCGAC
CTGTCCGGGG CGGTCGCCCA GTACGGGTGG GACAAGAAGA TCACCGGCAA CATCGCCGCG
ACCTCCCGGT ACACGAACGG GGTCATGGGC TCCGGCCCGT GGTACGGCAT CCCGGACTAC
GGCGAGTACG GCATGGTCTA TTACAACAAG GACATGTTCG CCAAGTACGG CGTCAAGGTG
CCCACCACGT TCGCGGAGTT CACCGCCGCC ATGGACACCT TCGTCAAGGC CGGCGTCACG
CCGCTGGCCA GCGCGGGCGC GGAGTACCCG GCGCAGCAGT ACCTGTACAA CCTCGCCCTG
TCCAAGGCCG ACCAGAACTG GGTCAACCAG TACCAGATCG CCGGCAAGGC GGACTTCAAG
GACGCGGCTT GGACCGGCGC CGCGACCACG CTGGCCGACT GGGTCAAGAA GGGCTACATC
GCCAAGGACT CGGTGAGCCA GAAGGCCACC GACATGGGCA ACGCCTTCGA ATCCGGCAAG
AGCCCGATGA TGGTCTCCGG CAGTTGGTGG TACGGCACCT TCGAGTCCGA GATCAAGGGC
TTCGCCTGGG ACACGTTCCT GTGGCCCGGC AACAAGCTGG TCCCCGGCTC CGGCGGCAAC
CTGTGGGTGA TCCCGAAGAA CTCCAAGAAC GCCGCCCTCG CCGAGGACTT CATCGACATC
ACGCTCCAGC CGGACATCCA GGCCCTGCTG GCCAACAAGG GGGCGGTCCC GGTCGCGGCC
AACGCCTCGG ACATCACCGA TCCGAAGGCC AAAGAGCTCG TTCAGAACTT CCAGACCCTG
CAAGCCTCCA ACGGCCTGGC GTATTACCCG GACTGGCCGG TACCAGGGTT CTACGACAAC
CTCACCGCCG CGACCCAGGA CCTCATGAAC GGCAAGAGTC CTGACTCGGT CTTGAGCGGT
CTGCAGAGCG CTTACAACCA AGGCCTGTCG CAGTAA
 
Protein sequence
MFLSRSSRRR AVAISAALAA VSLGVAGCGK STASGSSSKV LKLWHYEAAD SAMGVAWNQA 
IKEFEQKHPG VTVRFEQKTF EQLEKTAPMV LNSSDAPDIL EYNKGDATAG LLAKQGLLTD
LSGAVAQYGW DKKITGNIAA TSRYTNGVMG SGPWYGIPDY GEYGMVYYNK DMFAKYGVKV
PTTFAEFTAA MDTFVKAGVT PLASAGAEYP AQQYLYNLAL SKADQNWVNQ YQIAGKADFK
DAAWTGAATT LADWVKKGYI AKDSVSQKAT DMGNAFESGK SPMMVSGSWW YGTFESEIKG
FAWDTFLWPG NKLVPGSGGN LWVIPKNSKN AALAEDFIDI TLQPDIQALL ANKGAVPVAA
NASDITDPKA KELVQNFQTL QASNGLAYYP DWPVPGFYDN LTAATQDLMN GKSPDSVLSG
LQSAYNQGLS Q