Gene Caci_4312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4312 
Symbol 
ID8335666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4892640 
End bp4895663 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content67% 
IMG OID644957415 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003115017 
Protein GI256393453 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000989115 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCCCG CACCAAGATT TTCACGGAGA TCCAGATCAT CGGCACTGTT GCTGGTCCTG 
GCCATGTTGA TGGCCGTTCT GTCGGTACCG TCGCTCGGGG CGGGGCGGGC CAAGGCGGCC
AGCTGTGACA CCGGCACGAA CCTGGCACTG AACAAGAACG CCACCGCTTC CTCGATCGAG
GGCGCCGGAA CCCCGGCCAC GGCTGCGGTG GACGGCAACA CCGGCACGCG CTGGTCGTCG
CAGTTCTCTG ACCCGCAGTG GCTCCAAGTG GATCTCGGTA GCTCGCAGAG CATCTGCCAG
GTGACGCTGA ACTGGGAGAC CGCCTCGGGC AAGGCGTTCC AGATCCAGAC CTCCAACGAT
GCCGCTACCT GGACCTCGAT CTACTCCACG ACCACCAGTG CTGGCGGGAC GCAGACGCTC
AGCGTGTCCG GCACCGGTCG TTACATCCGG ATGTACGGCA CCGCGCGTAA CACCGGTTAC
GGCTACTCGC TGTGGGAGTT CGGCGTCTAC GGCAGCTCCA ACGGCGGAGG CGGCACCCCG
CCGCCGCCGA ACTGGAACCT GGTCTGGCAG GACACCTTCG GCGGCAACTC CGGAACCGCT
CCGTCCTCCG CCAACTGGAT CGAGGACACC GGCCACAACT CCGCGGGCGG ACCCGCGGAC
TGGGGTACCG GTGAGGTCGA ATCCGCCTCC TCGTCGACTG CGAACGTTTC CGTGGACGGC
AACGGCCACC TGAACATCAC TGCCCTCAAG GACGGTGCGG GGAACTGGAC CTCCGGCCGC
ATCGAGACTC AGCGCTCTGA CTTCGCCGCC CCGGCCGGCG GCATGCTGGA GGTCACCGCG
ACCATCAAGC AGCCGAACGT GGCCAACCCC GCCGGCTACT GGCCCGCTTT CTGGGCGCTG
GGCAACGGGT CGCGCACCGG CTCGGGTACC TGGCCGGCCA TCGGCGAGAC CGACATCATG
GAGAACGTCA ACGGTCACCA GACCACCTCC TCCGGTCTGC ACTGCGGGAC CGCGCCGAAC
GGTCCGTGCA ACGAGTCCAG CGGGCGTGGC AGCGGTCTGC GAACCTGCGC CGGCTGCCTG
AGCGCGTACC ACACCTACGC CGAGGTCATC GACCGGACGC AGTCCGACGA GCAGATCCGC
TTCCTGGTGG ACGGCCAGGT CACCTGGACG GTCAGCGAGA GCCAGGTCGG TGTGACCACC
TGGCAGAACG CGGTGGACCA CGGCTTCTTC CTGATCCTGG ACCTGGCGGT CGGCGGCTCC
TGGCCGAACG CCGACTGCGG CTGCACGTCC CCGACCGCGG CGACCACCTC CGGCGGCACG
CTGAGCGTCG GGCCGATCGC CGTCTACAGC ACCACCGGCT CGGCGCCGAC CCCGTTGCAG
CCCCCGGCAC CGGCCACCGG GTCCAGCACC GTGAAGGTCA CCGGCAGCCA GGGGAACTGG
CAGCTGAATG TGAACGGCGC GCCGTACCAG GTCAAGGGTG TGACCTGGGG TCCTGGCAAC
CAGGCCGGGG ACGGCTACCT CGCCGACGCC GCGTCCATGG GGGTCAACAC CATCCGGACC
TGGGGTACGG ATGCGTCCTC CCAACCGCTG CTCGACGCGG CGGCGGCGCG CGGGATCAAG
GTCATCAACG GCTTCTGGCT GAATCAGGGA GCTGACTACG TCAACGACAC CGCGTACAAG
ACCAACACCC TGAACAGCAT CAAACAGTTC GTCACGCAGT ACAAGAGCCA TCCGGCGACC
CTGATGTGGG ACGTCGGGAA CGAGGTGATC CTCACCTCGC AGAACTACAC CTACCCCAAC
GGCGCCACCG TGGAGCAGGA GCGCGTCGCC TACGCGCAGT ACGTCGAGCA GATCACCCAG
GCGATCCACG CCATCGACCC GAACCACCCG GTCACCTCGA CCGACGCCTG GACCGGCGCC
TGGCCGTACT ACAAGCAGTA CACGCCGAGC CTGGACCTGC TCGCCGTGAA CTCCTACGGC
TCGGTGTGCA ACGTGAACAC CGACTGGGTC AACGGCGGCT ACACCAAGCC CTACATCGTC
ACCGAGGCCG GCGACGCCGG CGAGTGGGAG GTCCCCAACG ACGCCAACGG CGTGCCCACC
GAGCCCACCG ACCAGCAGCA GCGCGACGGC TACACCAGCG CCTGGAACTG CATCGCGGGC
CACCCCGGCA TCTCCTTCGG CGGGACCCTG TTCAACTACG GCGTCGAGAA CGACTTCGGC
GGCGTCTGGT TCAACCTGCT CACCGGCGGC TGGCGCCGGC TGTCCTACTA CGCGGTCAAG
CAGGCCTTCA CCGGCCAGGC GCAGACCAAC ACGCCGCCGG CGATCACCTC GATGACGCTC
AGCAACACCG CGAGCGTCCC GGCGGGCGGG CAGTTCACCG TGAACGTCGC CTCGACCAAC
CCGACCGGCG ACGCGCTGAG CTACAACGTC GCCCTGTCGA GCAAGTACGT CAACAGCGCC
ACCCCGTTGC AGTCGCCGTC CAGCTACACC CAGACCGGTC CCGGCGCCTT CACCGTCACC
GCACCCCAGA CGCTCGGCGT GTGGAAGGTG TACGTGTACG TCTACGACCA ACACGGCGGC
GTGGGCATCC AGTCCGTGTC CTTCCGCGTG GTCGCACCCC CGGTCTCCGG CACCAACGTG
GCACTGGGCA AGGCCGTCAC GGCCTCGTCC TTCCAACCCG CCAGCAACGG CCAGACCTTC
GTCCCCGCCA ACGTCACCGA CAACAACTGG ACCACCCGCT GGGCCAGCGA CTGGAGCGAC
CCCCAGTGGA TCCAGGTCGA CCTCGGCCAG TCCACCGCCA TCAAGCACAT CCAACTCGGA
TGGGAGTCCG CCTACGCCAA GGCGTACCAG ATCCAGGTAT CCAACGACGG CACCAACTGG
ACCACCGTCC ACACCACCAC AACCGGCGCC GGCGGAGTCG AGACCTTCGA CGTCACCGGC
ACCGGCCGCT ACGTCCGGAT GTACGGCACG CAGCGGGGGA CGGCGTACGG CTACTCGCTT
TATGAGTTCG GGATCTATGC CTGA
 
Protein sequence
MIPAPRFSRR SRSSALLLVL AMLMAVLSVP SLGAGRAKAA SCDTGTNLAL NKNATASSIE 
GAGTPATAAV DGNTGTRWSS QFSDPQWLQV DLGSSQSICQ VTLNWETASG KAFQIQTSND
AATWTSIYST TTSAGGTQTL SVSGTGRYIR MYGTARNTGY GYSLWEFGVY GSSNGGGGTP
PPPNWNLVWQ DTFGGNSGTA PSSANWIEDT GHNSAGGPAD WGTGEVESAS SSTANVSVDG
NGHLNITALK DGAGNWTSGR IETQRSDFAA PAGGMLEVTA TIKQPNVANP AGYWPAFWAL
GNGSRTGSGT WPAIGETDIM ENVNGHQTTS SGLHCGTAPN GPCNESSGRG SGLRTCAGCL
SAYHTYAEVI DRTQSDEQIR FLVDGQVTWT VSESQVGVTT WQNAVDHGFF LILDLAVGGS
WPNADCGCTS PTAATTSGGT LSVGPIAVYS TTGSAPTPLQ PPAPATGSST VKVTGSQGNW
QLNVNGAPYQ VKGVTWGPGN QAGDGYLADA ASMGVNTIRT WGTDASSQPL LDAAAARGIK
VINGFWLNQG ADYVNDTAYK TNTLNSIKQF VTQYKSHPAT LMWDVGNEVI LTSQNYTYPN
GATVEQERVA YAQYVEQITQ AIHAIDPNHP VTSTDAWTGA WPYYKQYTPS LDLLAVNSYG
SVCNVNTDWV NGGYTKPYIV TEAGDAGEWE VPNDANGVPT EPTDQQQRDG YTSAWNCIAG
HPGISFGGTL FNYGVENDFG GVWFNLLTGG WRRLSYYAVK QAFTGQAQTN TPPAITSMTL
SNTASVPAGG QFTVNVASTN PTGDALSYNV ALSSKYVNSA TPLQSPSSYT QTGPGAFTVT
APQTLGVWKV YVYVYDQHGG VGIQSVSFRV VAPPVSGTNV ALGKAVTASS FQPASNGQTF
VPANVTDNNW TTRWASDWSD PQWIQVDLGQ STAIKHIQLG WESAYAKAYQ IQVSNDGTNW
TTVHTTTTGA GGVETFDVTG TGRYVRMYGT QRGTAYGYSL YEFGIYA