Gene Caci_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2139 
Symbol 
ID8333484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2424627 
End bp2425778 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content73% 
IMG OID644955289 
Producthypothetical protein 
Protein accessionYP_003112899 
Protein GI256391335 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGC GGTCCGCGCG GTCCGCATCG CCGGTCGAAG GGCTCGCCGC CGCGACCGAA 
CTCCTGGCTT GTGAAGGGGT TCTGAGCGCC GCGCAGATAC GGACCACTGT CGAGGCGATA
GCCGCGCTCC AACGTCCCGA CGGCGCGATC CCGTGGTTCC ACGGCGGCCA CCTGGACCCC
TGGGACCACA TCGAGGCGGC GATGGCGCTG GACGCGGCCG GGCTGGCGGA CCGCGCGCTG
GCCGCCTACC GCTGGCTGGC GGCGTCGCAG AACCCTGACG GTTCCTGGTA CGCGGCGTAC
GCCGACGCGC CGGGCGGCGT CACCGAGCCG ACCAACCGGC TGCGCGAGAC CAACTTCAGC
GCCTACATCG CGGTCGGCGT GTGGCACCAC TGGCTGGCGA CCGGCGACGA GGACTTCCTG
GCCCAGATGT GGCAGCCGGT GCAGCGCGCG ACGGACTTCG TGCTGAGCCT GCAGACCGCC
GGCGGGGAGA TCCTGTGGTG CCGCGACGAG CAGGGACGCG AAGCCGACGA GGCGCTGCTG
ACCGGCTGCT CGTCGATGTA CCAGGCGCTG CGCTGCGCGC TGGCGGTCGC CGAGCGCCTC
GGACGCGATC GCCCGGACTG GGAGCTGGCG TGCGGCCGGT TGGGGCACGC GCTCACGGCG
CACCCGGAGC GCTTCGCGGA CAAGGGCACG TACTCGATGG ACTGGTACTA CCCGGTGCTC
GGCACGGCGC TGCGCGGAGC GGCGGCCGAG CAGCGGATCG CCGAGGGCTG GGACGATTTC
GTGGTCCGGG ACCTCGGCGT GCGCTGCGTC TCGACGAACC CGTGGGTCAC CGGCGGCGAG
ACCTGCGAAC TGGCACTGGC GCTGTGGGCG ATCGGCGACA CCGAACGGGC CCGGATGCTG
CTGCGCGACA TCCAGCATCT GCGCGACGAC GCCGACGGGA TGTACTGGAC CGGCCGCGTC
TTCGAGGCGG ACGGCGCGGG GCACGAGCCC GCGCTGTGGC CGGTGGAGAA GACCACGTGG
ACGGCCGGCG CCCTGCTGTT GGCGCTCGCG GTGCTCGCCG AGGAGAAGGC GACGGTCGCG
GTGTTCGGGG GCGACGGGCT GCCCGAAGGG TTGCCGGTGG CGTGCTCGGT CGCGGAGTGC
GTCGCGGCTT GA
 
Protein sequence
MTVRSARSAS PVEGLAAATE LLACEGVLSA AQIRTTVEAI AALQRPDGAI PWFHGGHLDP 
WDHIEAAMAL DAAGLADRAL AAYRWLAASQ NPDGSWYAAY ADAPGGVTEP TNRLRETNFS
AYIAVGVWHH WLATGDEDFL AQMWQPVQRA TDFVLSLQTA GGEILWCRDE QGREADEALL
TGCSSMYQAL RCALAVAERL GRDRPDWELA CGRLGHALTA HPERFADKGT YSMDWYYPVL
GTALRGAAAE QRIAEGWDDF VVRDLGVRCV STNPWVTGGE TCELALALWA IGDTERARML
LRDIQHLRDD ADGMYWTGRV FEADGAGHEP ALWPVEKTTW TAGALLLALA VLAEEKATVA
VFGGDGLPEG LPVACSVAEC VAA