Gene Caci_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1052 
Symbol 
ID8332387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1193497 
End bp1197585 
Gene Length4089 bp 
Protein Length1362 aa 
Translation table11 
GC content69% 
IMG OID644954200 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003111819 
Protein GI256390255 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.311777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG AATCGAATCA GCCGTTCCGC GCGTCGCGGC GGACCGTCGT CGCGGCGGCG 
TCGACTTTGT TGGCCGGCTT CGCGGTGGAC ACCGCTTTCC CGAGTGTCGG CTTCGCCGCC
GAGCCGGGGA AGGCTGGAGT CCCGGGTCAT GCGCCGGCGC CGGGGGAGCT GGCGATGTAC
CGGCCGATCG AGGTCTCCTC GACCGATTAC GCGCCCACGC CTGGCGCGTT CGCCGTCGAC
CGGGTGACCT CGGCCGGGGT CAGGGGTACC GGGTGGCGCG CCGCGGCCGG TGATCCGCAG
TGGATCTCCG TGGACTTGCA GGCTGTCTGC CAGGTCACCT CTGTCCATCT GACCTTCGAG
GCGGCCGCGG GCGATCCGGT CTTCGTCAAG CCGACCTCCG GCAACTGGGC CGATGGGACG
ACGGGCAAGG AGCTGCTCTC CAGCTACGCG TCCGCCTTCG TCGTCGAGGT GTCGACGGAC
AAGAAGTCGT GGACGAGCGT GTATCAGACG GCGTCGGGCA CCGGCGGCGC GGTCGTGATC
GACCTGGCGG CTCCGGTCTC GGCGCGCTGG GTGCGGCTGA CGGCGTCCAA GCGCTCGGAT
GCGAATCCCT TGGGACTCAA CGGTTTCCAG GTGTACGGCA GCGCGAGCGG GCACCGGCCC
GCCGCCACCG GGTGGACCGA CTGGGGTACG CACGACAACC ACGCGCCCGG ACTGGTCGTG
GCCGCCGACG GCACCGTGCC GCTGGAGTCC GGCTGGGTGC TGACGATGGA CGACTGGGCT
CCCGGCGACG GGACCGCGCT GTCCGTGCCG ACCGTGGACA CCAGCGGGTG GTTGCCCGCC
ACGGTTCCCG GGACCGTCCT GGCCTCGCTC GTCGAGCAGG GGCATCTGCC CGATCCGGTC
GCGGGCTTCA ACAACCTGCG GATTCCCGAG GCGCTGTCCC GCCATTCCTG GTGGTACAAG
CGCGACTTCG CGCTGCCCGC CGCGCTGGGC GCCGGGCACG GACGCCGGAT CTGGCTGGAG
TTCGACGGAG TCAACCACCA GGCCGCGCTG TTCCTCAACG GAGCGCAGAT CGGCAGCCTC
ACCTACCCGT TCGCGCGCGC CGCGATCGAG GTGACCGAGC ACTTGGTCGC CGGCGAGCAG
TCCCTGGCCG TGAAGATCGA CCCGATGCCG ATTCCCGGCA GCCCGGGCGA CAAGGGACCG
GCCGGTCAGT CCTGGGTCGA TGCCGGCGCG CAGATCATGA ACATGAACTC CCCGACATAC
CTGGCCGCCT CGGGCTGGGA CTGGATGCCC GCGGTCCGCG ACCGGGTCAG CGGTATCTGG
AACCACGTGC GGCTGCGCTC CACCGGGGAC GTCGTGATCG GCGACCCTCG GGTGGACACG
GTGCTGCCGG CGCTGCCCGA CACCACCAGC GCGTCGGTGA CCATCGTCGT TCCCGTGCGC
AACGCCGGGT CCGCCGATGT CAGCGCGACC GTGACAGCGT CCTTCGACAC CGTGCGCCTC
TCGCAGACGG TGACCGTCCC GGCCGGTAAG AACCTGGACG TCACCTTCGC CCCGGCGAAG
TTCGCGCAGC TGAACCTGCG CGATCCGAAG CTGTGGTGGC CCAACGGCTA CGGCGATGCG
AACCTGCACA CCTTGAACCT GGCGGTCACG GTCGCGGGTC AGCGCAGCGA CCAGCGCACC
ACCCGCTTCG GCATCCGGCA GTTCGACTAC GAGTACAAGA CGCCGCTGCC CTTCGTCGCA
TCAGCCGACG CGTACACCCA GACCACGGAC CTCGGTGCGC GGCAGGCGCG CTATGTGCGC
ATCAACTGCC AGACGCGCGC GACCGGATGG GGCTTCTCGC TCTGGACGCT GTCCGTCCTG
AACAGCGCCA CGCCCGGTAC CGACCTGGCA CTGCACCAGC CGACGACCGC ATCGACGCAG
GACCCCTCGA ACCCGGTGAC CAACGCCACC GACGGGGACG CGAACACCCG CTGGTCCTCC
GACTTCGCCG ACGACCAGTG GATCGAGGTG GACCTCGGGG CGTCAGTGTC CTTCGATCAG
GTCGCCATCA CCTGGGAGCA GGCGTACGCG CGCACGTACA CCGTGCAGGT CTCGACGGAC
GGCTCGGTGT GGACGGACGC GAAGAGCGTG GACAACACGG CGATCCCGCT TCCGTTCAAC
GGCGGCGACG CGAGCCTGGA CGTCGAGTCG TTCGCCGCGG CCTCCGGCCG CTACGTGCGG
ATCAGCGGCG GCGTGCGCGA GACCAGCTGG GGTAACTCGC TGTGGTCGCT GTCGGTCCTG
AACAGCGCCA CGCCCGGAGT CGACCTGGCC TTGCACAAGA CGGCCACCGC CTCGACCGAA
GACCCCTCGA ACCCGGCGGC CAACGCCACC GACGGCAACA GCGGCACCCG ATGGTCCTCG
GACTATGCGG ACAACCAATG GATCCAGGTG GATCTCGGCT CGTCGCAGAC CTTTGACGGA
GTCGGCATCC TGTGGGAGCA GGCGTATCCG AAGACCTACG TGATCCAGGT GTCCGACGAC
GGCTCGTCGT GGACCGACGT GAAGACAGTC GGCCTCGCGC CGGAGTCGCT GAAGATCAGC
GTGAACGGTG TCAGAGTCCT GTGCCGGGGC GGCAACTGGG GCTGGGACGA ACTGCTGCGC
CGGATGCCCT CCGACCGGAT GGACGCGGCG ATCCGCATGC ATCGGGACAT GAACTTCACG
ATGGTCCGCA ACTGGGTCGG CGCCAGCAAC CGTGAGGAGT TCTACGCCGC GTGCGACGAG
TTCGGGCTGC TGGTGTGGAA CGACTTCCCG AACGCCTGGG GCATGGACCC GCCGGACCAC
GACGCGTACA ACTCGATCGC CGCGGACACC GTCCTGCGCT ACCGGATCCA CCCGAGCGTC
GTCGTGTGGT GCGGCGCGAA CGAGGGGAAT CCGCCGCAGG CGATCGACGA GGGCATGCGC
AACGCGGTGA CGAACGGCGC GCCCGGCATC CTGTACCAGA GCAACTCCGC CGGCGGGAAC
ATCACCGGCG GCGGTCCCTA CTACTGGGTC GAGCCGGAGA CCTACTACGA CCCGGCGACG
TACGGCAGCC ACAGCTTCGG TTTCCACACC GAGATCGGGA TGCCGGTGGT CTCCACGGCC
GACAGCCTGC GGAACATGGC CGGCGAACAG CCGGCGTGGC CCATCGGCGG TCCGTGGTAC
TACCACGACT GGAGCCAGTA CGGTAACCAG TCGCCGCTGC AGTACCAGGC CGCGATCGAA
GCCCGGCTCC AGACCTCGAA CACCCTCGAG GACTTCGCCC GCAAGGCGCA GTTCGTCAAC
TATGAGAACG CGCGCGCGAT GTTCGAGGCG TGGAACGCGA ACCTGTGGGC CGATGCCAGC
GGTCTGATGC TCTGGATGTC GCACCCGGCG TGGCACAGCA CGGTGTGGCA GACCTACGAC
TATGACTTCG ACGTCAACGG CATGTACTAC GGCGCGCGCA AGGCGTGCGA GCCCGTGCAC
GTGCAAGCCG ATCCGGTCCA CTGGCAGGTC GTCGCGGTGA ACCACACGCC GCACGCGGTG
AGCGGCGCGA CGGTCTCGGC GCGGCTGTTC GACCTGTCGG GCAGGCAGCT CGGCACGACG
CAGAGCGCTG CGATCAACGT CGCGGTGGCG GACAGTGCGA AGGCCTTCGC GGTGGCATGG
ACCGACGCGC TTCCCGATCT GCACCTGTTG CGCCTTACGC TCCAGGACGC CTCGGGCAAG
ACACTGTCGG AGAACACGTA CTGGCGGTAC CGCGCTCCGT CGGCGATGCA GGCGCTGAAC
AAGGCGCAGC AGACCCGGAT CACGGCCTCG ATCACCGGGG CGACCAGCGG CGGTGACGGG
CGCCGCCAGC TGACGGCGAC GGTTCGCAAC CAGGGCTCGA GCGTCGCGGC GATGGTGCGG
CTATCACTGC AGAACCGCGC TTCGGGGCAG CGGGTCTTGC CCACGCTCTA CGGAGAGAAC
TACGTGTGGC TGCTCCCCGG CGAGACGCGC ACGATCATCG TGTCGTTCCC GTCCAGCGCG
CTGCCCAAGG AGCAGCCGGA GCTGCACGTC GAGGGCTACA ACACGAGCGC GGTCATCGCT
CGCGCATAG
 
Protein sequence
MAQESNQPFR ASRRTVVAAA STLLAGFAVD TAFPSVGFAA EPGKAGVPGH APAPGELAMY 
RPIEVSSTDY APTPGAFAVD RVTSAGVRGT GWRAAAGDPQ WISVDLQAVC QVTSVHLTFE
AAAGDPVFVK PTSGNWADGT TGKELLSSYA SAFVVEVSTD KKSWTSVYQT ASGTGGAVVI
DLAAPVSARW VRLTASKRSD ANPLGLNGFQ VYGSASGHRP AATGWTDWGT HDNHAPGLVV
AADGTVPLES GWVLTMDDWA PGDGTALSVP TVDTSGWLPA TVPGTVLASL VEQGHLPDPV
AGFNNLRIPE ALSRHSWWYK RDFALPAALG AGHGRRIWLE FDGVNHQAAL FLNGAQIGSL
TYPFARAAIE VTEHLVAGEQ SLAVKIDPMP IPGSPGDKGP AGQSWVDAGA QIMNMNSPTY
LAASGWDWMP AVRDRVSGIW NHVRLRSTGD VVIGDPRVDT VLPALPDTTS ASVTIVVPVR
NAGSADVSAT VTASFDTVRL SQTVTVPAGK NLDVTFAPAK FAQLNLRDPK LWWPNGYGDA
NLHTLNLAVT VAGQRSDQRT TRFGIRQFDY EYKTPLPFVA SADAYTQTTD LGARQARYVR
INCQTRATGW GFSLWTLSVL NSATPGTDLA LHQPTTASTQ DPSNPVTNAT DGDANTRWSS
DFADDQWIEV DLGASVSFDQ VAITWEQAYA RTYTVQVSTD GSVWTDAKSV DNTAIPLPFN
GGDASLDVES FAAASGRYVR ISGGVRETSW GNSLWSLSVL NSATPGVDLA LHKTATASTE
DPSNPAANAT DGNSGTRWSS DYADNQWIQV DLGSSQTFDG VGILWEQAYP KTYVIQVSDD
GSSWTDVKTV GLAPESLKIS VNGVRVLCRG GNWGWDELLR RMPSDRMDAA IRMHRDMNFT
MVRNWVGASN REEFYAACDE FGLLVWNDFP NAWGMDPPDH DAYNSIAADT VLRYRIHPSV
VVWCGANEGN PPQAIDEGMR NAVTNGAPGI LYQSNSAGGN ITGGGPYYWV EPETYYDPAT
YGSHSFGFHT EIGMPVVSTA DSLRNMAGEQ PAWPIGGPWY YHDWSQYGNQ SPLQYQAAIE
ARLQTSNTLE DFARKAQFVN YENARAMFEA WNANLWADAS GLMLWMSHPA WHSTVWQTYD
YDFDVNGMYY GARKACEPVH VQADPVHWQV VAVNHTPHAV SGATVSARLF DLSGRQLGTT
QSAAINVAVA DSAKAFAVAW TDALPDLHLL RLTLQDASGK TLSENTYWRY RAPSAMQALN
KAQQTRITAS ITGATSGGDG RRQLTATVRN QGSSVAAMVR LSLQNRASGQ RVLPTLYGEN
YVWLLPGETR TIIVSFPSSA LPKEQPELHV EGYNTSAVIA RA