Gene Caci_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2837 
Symbol 
ID8334186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3242886 
End bp3246335 
Gene Length3450 bp 
Protein Length1149 aa 
Translation table11 
GC content65% 
IMG OID644955981 
Producthypothetical protein 
Protein accessionYP_003113587 
Protein GI256392023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00780437 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000194456 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCGC TGGAAGCCAG AGCGAAGGCG CTGGCCGACG CCGCCAACTT CGTCGCGGGA 
AGCGGCGCCA CCTTCAACAC CGACCTCGAT GACGCCGGCC TGCGCGACAA AGTCATCGAG
GCCGTTCGGA CGGCCGAGCG GAACATTGCC GTCATCGTCG GCATCGACCT GGACGACCCG
GATCTCCGCG AAAAGGTCAT CGAGGCCGTC AGCGCGGCCG AAAAGGGCAT CAACATCATC
GTCGGGATGG ATGTCGATGC CGACGGCTTG AAGGAGCGAG TCAAGGCCGA GGCGGATGCT
GCCGGCGCCG GCGAGAAGAT CAAGGTCCGG GTCGAGTCCG ACGGAACCAG CCTGGAGCAG
GACGTTGCAT CCAAGGCGGG ACGCGTAAAG CCCCAGCCCA TCAAGGTGCC GATCCAGTCG
GACGCCGATA AGTTCGAGGC CGAACTGCGC GCGTCGTTCG CGGAGGGCGA GAAGAACGCT
GCGGCCGCCG AAAAGGCGAT GAACCAGTCC TTCACTGCGA TGCAGACCGG GGTGCGCGCG
CTCCGGTCGG CCATGGCCGA GCTTGAGCCG GCAACGCAGG ATGCTGAAGA TTTTGAAACG
TCCTTCCGCA AAGCCATGGA CGAGGGCGAA AGGGTTTCAC AAGAGGCCGA CCGCGTCCTG
CGCCAGTCGT TCACCTCCAT GGAATCCGGC TCGCGGACTT TGCGTGCGGC GATGACCGAG
CTCCAGCCGG CCGCCGAGGG CGCTGGTCAG GCCGCAGCGA ATGCGGGCAG CGGTTTCAAT
ACGAGCGCCC TCAGGATGTC CTCGCTGATC GGGGCGGCCT TGGCGCTCGG CCCAGCGCTG
GCAGCGATCC CGGCTGTAGT GGGCGCCGCG GGCGCGGGCT TCGCAACGCT GGGCCTGGGG
ATGGCCGGGC CGATTGCCGC GCTGCGGGAC TACGGGGCGC AGAGTCAAGC CACGGGCCAG
TCCTCGGCGC AGCTCGCGGC AACGGCCTTC AGTAACGCTG TTTCGATTCG CAATGCCGAG
CAGGCAATCG CCGACGCGAA GCGGCAGGCC GCGATTTCGG CGATTAACTC GGCCCAGTCG
ATCGAGTCTG CGGAACAGGG CGTTACCGAC GCCGAGCGGC AGGCGGCGAT CTCGGCTCAA
TCGGCAGCAG ATGCCGTGGC TTCGGCCGAT CAGCGCCTCG CGAACGCGCA GGAGTCGTTG
ACGCAGGCAC AGGAGTCGTT GACGCAGGCT CAGAAGGACG GCGTCAACGT CCTCAAGGAC
TTGAATCTGG CTTCGGCCGA TGCGGCGAAC TCTGTCGCGG ACGCGCAGAA CGCGGTCATT
GACGCGCAGG CGGCCTACGA CAAGGCCAAG GGCAACAGCC TGCTGACTGA TCAGCAGAAG
AAGGAAGCGC AGCAGCAGCT GATCGACGCC CAGCAACACC TGACGGACGC GCAGCAGAAG
GCGCTTGAGG CGCAGCAGGC CGCGAACGAC GCCAACCAGA AGGGCGTGGA CGGCAGCACG
GCGGTCGTCG CGGCGCAGCG GCAGGTTGTC TCGGCCACGC AGGGCGTAGC CGACGCTCAG
CTTGCGGCAA CTCGCGCCCG GGAGGCGCAG GCCAACCAGG AGATCTCAAG CAACCAGTCG
GTCGCGAAGG CTCAGCAGTC GCTGGCTACC GCGATCCGCG ATGCGGCGGA ACAGCAGATC
TCGTCGAATG AATCGGTGTC CAAGGCGGTT CAGGCACTCA AGGACATGCA GGAGCAGCAG
GCTCTGTCTG CCGCGGCTGC GGCATCTTCG GGGTCTGCGG CTGCGAACAA ATTCGCGCAG
GATATGGCGA AGTTGACCCC GGCCGGCCGG GATTTCGTCA ACCAGCTGAT TTCGATGCGG
GGCGGTCTGC ACGATCTCGA GGCCACTGCG CAGACGACGC TGCTGCCCGG CTTCACCACG
CTCCTGAAGG ACGTGGGCGG TTCGAACGGC CTCGGCTCGC TGTTCAACAA GGCCGTCGGC
GACATGGGCA CGATCATCGG CGGCACGGCG ATCCAGTTCG GGAACCTGAT GACGTCGCCC
GCGTTCAAGG GCCAGCTGAC GCAGGTGCTG AAGGACGGCG CGGGGTTCGC GAAGGATCTC
GGCGACGGAC TTGTGGCTTT GACGGGCGGC CTGACTAAGG CGGCGTCGCA GGCAGGCCCG
ATAGTGTCCG GGCTCGGCGG CGGAATCAAA ACCTTGATGT CGTCCGGGAT TCCCGACTTC
TTCAGCGGTC TGGTCACCAA TGCGGGCGGC GCCGGCCAGT CGATACAGGC CATCTTCACG
ATCGTTTCCA ACCTTGCCGG TCCGCTGGGC ACGATAGCCG GCGCGTTCTC TGCGGCACTT
GCTCCGGCGC TGCAGGTTCT GGACTCCCCG CAGGTCCAGC AGTCACTACA GTCGATCGCG
ACTTCAATTG CGCAGATCCT GATCGTCCTG TCGCCGGTGG TCACAATGCT CGCGCAGGGT
CTGGCAGGGG CGCTGCGGAT CGTGGCGCCG CTGATGCAGT CGCTGGCGAA GTTCATCCAG
GACAACCAGC AGTGGGTGGT GCCGCTGGCC AAGGGGATCG CGATTGCCAC GATCGCTTTT
GTCGCTTTCA ACGCAGTGCT CGCTGCGAAC CCCGTCCTGC TGGTGGTAGC CGCAATCGCG
GCCCTGGTTC TCGGTGTGGT CTACGCATAT GAGCACTTCA AGATATTCCG CGATGTCATC
CACGATGTGT GGGTCGTTAC GAAGGCCGAG TTCGACTTCT TCCTGGGCTT CATAAAGCGG
TGGTGGCCGG AGCTGCTGGC ACCGTTCACC GGCGGCGTGT CAGAGATCAT CGCCCACTGG
GACGCGGTCG TCGACTTCGT GAAGAAGCTA CCGGGCCGAC TGGTCTCCGC GGGCGCGCAC
ATGTGGGACT GGATCTCTCA GAAGTGGGAC GACGACGTAG CCGCGCCGGT CAGCAAGGCC
TTCGACGGCT TCATTCACAC AGTGACCGGG CTGCCGGGCA AGTTGGCCAG GGCCGGCGCC
GGCATGTGGG ACTGGATCAA GGAAGAGTTC GTCGGCGCCC TCAATGCAAT TGCCAACCTG
TGGAACCAGT TGCACTTCAG CACGCCGAGC TTCCACATCC CGATTCCCTT CAGCAGCGGC
ATCAACGTCG ACTCGATAAC CGTCGGGGTA CCGCCCATCG GCCCTTTCAA GGCCGCCGGC
GGCCCCATCT GGGGCGGCCT GTCCGCGATC ATCGGCGAAG CAGGGACCGA ACTTCTGAAA
CTGCCGACCG GCACCCAGGT CATGCCCCAT GCCAACACTC AATCAATGAT CGCCCAGGGC
GGCCTTGGAT CGTCCGGCGG CGTGCTTCAG ATCGAGTGGG TCGGCGGCAA CGGCGGCGAC
GAGCTCATGA CGTGGATCCG CAAGAACATC CGCATCCGCC ACGGGTCGGA TCCCAACAGC
GTCCAGAAGG CCCTCGGGCA GAGCTTTTGA
 
Protein sequence
MAALEARAKA LADAANFVAG SGATFNTDLD DAGLRDKVIE AVRTAERNIA VIVGIDLDDP 
DLREKVIEAV SAAEKGINII VGMDVDADGL KERVKAEADA AGAGEKIKVR VESDGTSLEQ
DVASKAGRVK PQPIKVPIQS DADKFEAELR ASFAEGEKNA AAAEKAMNQS FTAMQTGVRA
LRSAMAELEP ATQDAEDFET SFRKAMDEGE RVSQEADRVL RQSFTSMESG SRTLRAAMTE
LQPAAEGAGQ AAANAGSGFN TSALRMSSLI GAALALGPAL AAIPAVVGAA GAGFATLGLG
MAGPIAALRD YGAQSQATGQ SSAQLAATAF SNAVSIRNAE QAIADAKRQA AISAINSAQS
IESAEQGVTD AERQAAISAQ SAADAVASAD QRLANAQESL TQAQESLTQA QKDGVNVLKD
LNLASADAAN SVADAQNAVI DAQAAYDKAK GNSLLTDQQK KEAQQQLIDA QQHLTDAQQK
ALEAQQAAND ANQKGVDGST AVVAAQRQVV SATQGVADAQ LAATRAREAQ ANQEISSNQS
VAKAQQSLAT AIRDAAEQQI SSNESVSKAV QALKDMQEQQ ALSAAAAASS GSAAANKFAQ
DMAKLTPAGR DFVNQLISMR GGLHDLEATA QTTLLPGFTT LLKDVGGSNG LGSLFNKAVG
DMGTIIGGTA IQFGNLMTSP AFKGQLTQVL KDGAGFAKDL GDGLVALTGG LTKAASQAGP
IVSGLGGGIK TLMSSGIPDF FSGLVTNAGG AGQSIQAIFT IVSNLAGPLG TIAGAFSAAL
APALQVLDSP QVQQSLQSIA TSIAQILIVL SPVVTMLAQG LAGALRIVAP LMQSLAKFIQ
DNQQWVVPLA KGIAIATIAF VAFNAVLAAN PVLLVVAAIA ALVLGVVYAY EHFKIFRDVI
HDVWVVTKAE FDFFLGFIKR WWPELLAPFT GGVSEIIAHW DAVVDFVKKL PGRLVSAGAH
MWDWISQKWD DDVAAPVSKA FDGFIHTVTG LPGKLARAGA GMWDWIKEEF VGALNAIANL
WNQLHFSTPS FHIPIPFSSG INVDSITVGV PPIGPFKAAG GPIWGGLSAI IGEAGTELLK
LPTGTQVMPH ANTQSMIAQG GLGSSGGVLQ IEWVGGNGGD ELMTWIRKNI RIRHGSDPNS
VQKALGQSF