Gene Caci_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3708 
Symbol 
ID8335061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4163869 
End bp4165518 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID644956848 
Productalpha amylase catalytic region 
Protein accessionYP_003114451 
Protein GI256392887 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0503251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00685051 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGAAA CCACCACCGG CCCGACGACG ACCGCCTGGT GGCGTACCGC GTCCATCTAC 
CAGATCTACG TGCGCAGCTT CGCCGACGCC GACGGGGACG GGATCGGGGA CCTCGCCGGG
ATCCGTTCCC GGCTGCCGTA CCTGCGCGAT CTCGGTGTGG ACGCGCTGTG GCTGACGCCG
TGGTACGTCT CGCCGATGGC CGACGCGGGG TATGACGTCG CAGACTATTG CGACATCGAT
CCGGTCTTCG GGGATCTGGC GCAGGCCCGA AAGCTGATCG ACGCGGTCCA CGGTTTCGGA
ATGCGGATCA TCATCGACAT CGTCCCCAAC CACTGCTCGG ATCAGCACCC TTGGTTCCAG
GCGGCGCTGG CGGCCGGTCC CGGTGCGGCC GAGCGCGAGC GGTTCTGGTT CCGTCCGGGA
CGCGGTGCGG ACGGCTCGCT TCCCCCCAAC GACTGGCAGT CCTACTTCGG CGGCCCGGCC
TGGACACGGA TCACCGAACC CGACGGCACG CCGGGACCCT GGTTCCTGCA CATGTTCACC
CCCGAGCAGC CCGACCTGAA CTGGGAGGAC CCCGGCACGC TCGCCGCGTT CGAGCAGATC
CTGCGCTTCT GGCTCGATCG CGGCGTCGAC GGCTTCCGCA TCGACGTCGC CCACGGCTTG
ATGAAGAAGC AAGGACTCCC GGACGTCGGT CCCACCCCGA TCCCCCACGA CCTCCCCTAC
CAGGACCGCC CCGAGGTACA CGACGTCTAC CGCGCCTGGC GCCGCGTCAC CGACTCCTAC
CCCGGCGAGC GCGTGATGGT CGGCGAGGTC TGGCTCCCGA CCCCCGAGCA GTACACCCGC
TACCTCCGCT CCGACGAACT GCACTCGGCC TTCAACTTCG AATTCCTGTG CAGCGCCTGG
GACAGCGAAG CGATCCGCAG GGTCATCGAC GACACCCTGG CCTCCCACGC CGCCGTCGGA
GCACCCCCGA CCTGGGTCCT GTCGAACCAC GACACGATCC GCCACGTCAC CCGCTACGGC
CGCGAGAACA CCGCCTTCGA CATGGGCGAC AAGCGGCACG ACGACCCCTC CGACCCCGTC
CTCGGCACCC GCCGCGCCCG AGCCGCAGCC CTCCTGACCC TCGCCCTCCC CGGCGGCGTC
TACATCTACC AGGGCGACGA ACTCGCCCTC CCCGAGGTCC GGGACGTACC CCCGGACCGC
ATCCAGGACC CCACCTGGGA ACGCTCCGGC CACACCGACC GCGGCCGCGA CGGCTGCCGC
GTCCCCCTGC CCTGGTCCGG CGAAGCACCC CCCTTCGGCT TCTCCGACCC CGAAGCGCAC
GCAGACCCCT GGCTCCCGCA ACCCGCGTCC TGGAAGGAGA CCACGGCAGC GGTCCAGAAT
CAGGACCCTG ACTCAACACT GAACCTCTAT CGCGAGGCAC TGACCCTGCG CCGCGAAACG
ATCTCCAACC TCCCCACCGA CGTCACCTGG ATCGACGCGG GAGCCGACGT CGTGGCGTTC
ACCCGCTCGC AGGACTTCAC GTGCATCGTG AACTTCTCAG CCACCCCCCT CGACCTGCCC
CCCGCGCATC GCGTCCTGCT GAGCAGCGAC GTGGTGAGCG GCGGAAAGCT CGCCCCGAAC
GCCGCGGTGT GGCTTGGAGC CCGGGCCTAG
 
Protein sequence
MQETTTGPTT TAWWRTASIY QIYVRSFADA DGDGIGDLAG IRSRLPYLRD LGVDALWLTP 
WYVSPMADAG YDVADYCDID PVFGDLAQAR KLIDAVHGFG MRIIIDIVPN HCSDQHPWFQ
AALAAGPGAA ERERFWFRPG RGADGSLPPN DWQSYFGGPA WTRITEPDGT PGPWFLHMFT
PEQPDLNWED PGTLAAFEQI LRFWLDRGVD GFRIDVAHGL MKKQGLPDVG PTPIPHDLPY
QDRPEVHDVY RAWRRVTDSY PGERVMVGEV WLPTPEQYTR YLRSDELHSA FNFEFLCSAW
DSEAIRRVID DTLASHAAVG APPTWVLSNH DTIRHVTRYG RENTAFDMGD KRHDDPSDPV
LGTRRARAAA LLTLALPGGV YIYQGDELAL PEVRDVPPDR IQDPTWERSG HTDRGRDGCR
VPLPWSGEAP PFGFSDPEAH ADPWLPQPAS WKETTAAVQN QDPDSTLNLY REALTLRRET
ISNLPTDVTW IDAGADVVAF TRSQDFTCIV NFSATPLDLP PAHRVLLSSD VVSGGKLAPN
AAVWLGARA