Gene Caci_5029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5029 
Symbol 
ID8336383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5760980 
End bp5763826 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content67% 
IMG OID644958128 
Producthypothetical protein 
Protein accessionYP_003115730 
Protein GI256394166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0984885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACA GAACACGGCT GCGGCGCACT GCCGCCGCCG GCGCTGTCGC AGCCCTCGTG 
GGAACGATGC TCACCACGAC CCTGACCGGT CCGGCGAGGG CCGACTCGGT CACCGCGAAC
CAGGCCTGGC GCATCGCCCA GCAGTACACC GGCGTCTGGA CGAGCCCGCC GTCCGCCCTG
ACCAACGGCG AGACGGTGGA CGCCCCGATG CTGGGCAACG GCGACATCGG CGTGGCGATC
GGCGGGTCGA TCGCGAATCA GACCATGTAC CTCGGCAAGA ACGACTTCTT CTCCGGGTCC
GCCCACGCGA TCAAACCGCT GGGACGGATC GTGGTCACCG CGGCCGGGCT GAACGGCTCG
TCCTACCACG TCGTCCAGGA CATCGCGCAC GCCGAAGTGC GCGGCACGTA CACCCTGGGC
AGCCAGACGC TGAGCACCAC GAGCTGGGTC GACGCGAACT CCGGCATGTA CGTCACCTCC
TTCGCCCTGA CCGGCGGCAG CGCGCAGAGC ATCGGCATCG CGCTGCAGAA CGGGAGCGGC
GGCACCCCGA GCGTCAGCAC CAGCGGCAAC GACCTGGACG CCGACGTCGC CGCGGACACC
GGAACCGGCA GCGACCCGCA CGCCCGGATC GCCGCGCGCA CGATCGGGCA GACCCAGTCG
ATCTCCGGCA ACAAGATCAC CCTGACCATC CAGCCGGGGA CCACGTCCAC TCTCGTGGCC
GGGATCGTCT CCAGCATCGA CAGCTCTTCG TGGCAGTCCG GCGCCGATGC GCTGGTCGGC
TCGCTGGCTC AGGCAGACGT CGCCAACCAC AACGCCGCGC ACCGTTCCTG GTGGCAGAAC
TACTGGCAGC AGTCCTACGT CGAGATCCCC GACAAGACGG TGGAGAAGAG CTGGTACGGC
TCGCTCTACC TGCTCGGCTC CGTCTCGCGC GCCGGGAAGT ACGCTCCCGG GCTGTGGGGC
AACTGGATCA CCGGCGCGAT GAACTGGAAC GGTGACTACC ACACCAACTA CAACTACGAG
GCGCCGTTCT ACGCCGCCTT GTCCACCAAC CACATCGCGC AGATGGCCGC CTATGACCAG
CCGGTGCTGG ACTGGCAGTC CGGCGGCCAA TCGCTGGCGT CGCAGAACGG TTTCTCCGGC
GTGCTGTACC CGGTCGGCTT GTCGCCCAAG GGCACCAGCG CCGACATGAA CCTGCACAAC
CAGAAGTCCA ACGCCGCGAA CCTCGCCAGC GACATGGTGA TGCGCTTCGA GCACACCGGC
GACACGTCGT ACGCGACCAC CGTCTACCCG TGGCTGAAGC AGGTCGGGCT GTTCTGGCAG
AACTACCTGA CCTGGGACGC GGCGAACAAC CGGTATGTCA TCACCAACGA CGCCCCGCAC
GAGGACCAGT CCTACCCGCA GACCAACAGC GGGCTGTCGC TCGGGCTGGT GCACCTGCTG
TTCCAAGGCC TGATCGACAT GAGCACGGCG CTGAATCAGG ATGCTTCGAC CCGCGCCACC
TGGCAGAACA TCGAGTCTCA TCTCAGTGCC CTGCCCACGA TGTCGCTGAA CGGGCAGACC
ATCCTGCGCG AGACCGAGGT CGGCAGCGAT TTCATCAACG ACGGCAACGA CATCGACTCC
CAGGCGATCT ACCCCGGCAG CTTGATCGGC CTGGACAGCG ACGCGGCCTC GCAGCAGAAC
GCCCGCAACA CCATCGGCGC GCTGACCAAC GCCTGGCACG GCGGCAACGC GCCGGCCACG
TTCTACGCCG CGGCGGCGCG CGTGGGCTAC AACCCGAGCA CGATCCTGTC CAACCTGGAC
TCCGAAGCCG CGAACAACGC CTATCCCAAC ATGGCGATCC ACCACAACGG CGGCGGCATC
GAGAACATCA ACGTCACCAC CTCCGGGCTG GACGAGATGC TGCTGCAGTC CTTCCAGAAG
GACGTCAAGG TGTTCGCCGA CTGGCCGGCG AACACCAACG CGAAGTTCGG CGACCTGCTC
GCGTACGGCG ACTTCCTGAT CTCCTCCAGC AAGTCCGGCA ACGCCGTCCA GTACATCCGG
GCCGTCAGCC AGAAGGGCGG AAGCCTGACC GTCACCAACC CCTGGTCCGG CAGCGTCGAG
GTCTACCGCA ACGGCACCGA CACCGGCGCC GTGTCCGGGG CGAAGCTCAC GATCGCGACC
TCGGCCGGCG ACACGATCGA CCTCGCCCCG GCCGGTACCT CGCTGGCGAC CATCCAGTCC
GAGCTGTCCC AGCCGCTGCA GACCACCTCC AGCGGCAGCT TCAGCTCCGG ATTCGAGAGC
AGCGACCCGG CGGTGAGCTG GAGCGACACG GTCGACAGCA GCGGCGGCGG CAGCACGGGC
GTCACGGGGA TCTGCTGCGG CGCACCCGGC CCGGAAGCCG GAGTCCGCAC CGGTGAGACT
TCGCACACCG GGTCCAGCTC GTTGATGTAC TCCGGATCCG CGCAAGGCGG CACCAACGAC
TATGCGTACC TGAAGGTCTA TGACCTCAGC GGCAGTCCGC TGGCGATCGG ATCCGGGAAG
ACCCTCGGCT ACTGGATCTA TCCCCAGAGC AACGCCACCA GCACATGGGT CCCAGCCGGT
TCCACGAACA GCAGTTGCGT CGCCGTCGAC ATGGTCTTCA CCGACGGCAG CACCCTGAGA
GACTCCGGCG CCGTGGATCA GAGCGGCACC AAGATCCATC CGGCGAACCA GTGCGGGCAT
CTGACGCTGG ACGCCTGGAA CCATGTCACG GTCAATCTGG GGACGAACAA CGCCAACAAA
CAGATCAGCC GGATTCTGGT CGGCTACGAC CATCCGAACT CCACCGGCGG TTACCGCGGC
TACGTCGACG ATCTGACCGT CAGCTGA
 
Protein sequence
MRHRTRLRRT AAAGAVAALV GTMLTTTLTG PARADSVTAN QAWRIAQQYT GVWTSPPSAL 
TNGETVDAPM LGNGDIGVAI GGSIANQTMY LGKNDFFSGS AHAIKPLGRI VVTAAGLNGS
SYHVVQDIAH AEVRGTYTLG SQTLSTTSWV DANSGMYVTS FALTGGSAQS IGIALQNGSG
GTPSVSTSGN DLDADVAADT GTGSDPHARI AARTIGQTQS ISGNKITLTI QPGTTSTLVA
GIVSSIDSSS WQSGADALVG SLAQADVANH NAAHRSWWQN YWQQSYVEIP DKTVEKSWYG
SLYLLGSVSR AGKYAPGLWG NWITGAMNWN GDYHTNYNYE APFYAALSTN HIAQMAAYDQ
PVLDWQSGGQ SLASQNGFSG VLYPVGLSPK GTSADMNLHN QKSNAANLAS DMVMRFEHTG
DTSYATTVYP WLKQVGLFWQ NYLTWDAANN RYVITNDAPH EDQSYPQTNS GLSLGLVHLL
FQGLIDMSTA LNQDASTRAT WQNIESHLSA LPTMSLNGQT ILRETEVGSD FINDGNDIDS
QAIYPGSLIG LDSDAASQQN ARNTIGALTN AWHGGNAPAT FYAAAARVGY NPSTILSNLD
SEAANNAYPN MAIHHNGGGI ENINVTTSGL DEMLLQSFQK DVKVFADWPA NTNAKFGDLL
AYGDFLISSS KSGNAVQYIR AVSQKGGSLT VTNPWSGSVE VYRNGTDTGA VSGAKLTIAT
SAGDTIDLAP AGTSLATIQS ELSQPLQTTS SGSFSSGFES SDPAVSWSDT VDSSGGGSTG
VTGICCGAPG PEAGVRTGET SHTGSSSLMY SGSAQGGTND YAYLKVYDLS GSPLAIGSGK
TLGYWIYPQS NATSTWVPAG STNSSCVAVD MVFTDGSTLR DSGAVDQSGT KIHPANQCGH
LTLDAWNHVT VNLGTNNANK QISRILVGYD HPNSTGGYRG YVDDLTVS