Gene Caci_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2854 
Symbol 
ID8334203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3257843 
End bp3261151 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content66% 
IMG OID644955998 
Productportal protein 
Protein accessionYP_003113604 
Protein GI256392040 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0279665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00446406 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTCCGGT CGATGTTCAA CGTCGTCGGC AGCCTACTCA ACAAGAACGC CAACGCCTCA 
CCGGTCCCCT ACGCCGCACC AGGCCGCTAC ATCATCCCCG CACTGTCCGG ACGCTCGGAC
AACGAGGTCT ACATGCGGGC GATGGGGACC GCGGGCACGA TCTTCCAAAT CGTGTCGCTG
CTGTCCTCCG CCAGCGCCAC CCCACAGTGG CGGCTGTACC GCAAGCCGAA AGCCGACGGC
AGACAGCGGT ACACCACCGG CGACCGCGGA TCCGACCAGC GCACCGAAGT GCTGCAGCAC
CAGGCGCTGA ACGTCTGGAA CAACCCGAAC CCGTTCACCA CCGGCGTCAA CTTCCGCGAA
GCCGGCTGGC AGCACATGGA GCTGACCGGC GAACAGTGGT GGGTTGTCGT CCGCGACTCC
CGAGCCACCT TCCCGACCGG GCTGTGGCTG GTCCGCCCGG ACCGCATGGA GCCGGTGCCG
TCGGCGGAGA AGTACATCGC CGGGTACGTC TACACCGGAC CGTCGGGCGA GCGGGTTCCG
CTGCAGCCCG AAGAGGTCAT CCTCACCAAA TACCCGAACC CCCTGGACCC ATACCGCGGG
CTCGGGCCGG TGCAGTCGGT GCTGGTCGAC ATCGACGCCA TGAAGTACGG CAGCGAGTGG
AACCGGAACT TCTTCATCAA CGGCGCTGTC CCCGGCGGCG TCGTCACTGT GCCGGGAAAC
ATGTCGGACG ACGAGTTCGA CCAGTTTTCG ACCCGGTGGC GCGAATCTCA CCAGGGCGTG
TCCCGGTCCC ACCGCGTTGC GATCCTCGAG GGTGGCGCGA CCTGGGTTCC GACCCAGATG
TCCATCAAGG ACATGGACTT CTCCAACCTG CGGAACGTCA GCAGGGATGT CCTGCGCGAG
GCCTGGGGCA TCCACAAGTC GATGCTCGGC AACGCTGACG ATGTCAACCG CTGCCACGAC
GACCAGACCG AGGTCCTCAC CAACGGCGGC TGGAAGCCGT TCGAGAAGGT CGAAGACGCC
GACCTCATCG CCACCGTTAA CCCCGCGACG CGCCGCATTG AATACCACGC GCCGATCCAC
CGCTTCGCCT ACGACTACCA CGGCGACATG GTTCGGATCG TCAACGACGT CATCGACGTG
AAGGTCACCC CCAACCACAC GATGCTGTAC GCGACACCGC GGTTGCCGGA CGTTTGGCGT
ACATGCCGCG CGGACCAACT CCCGTCCCGG TTCCTAGTTC AGACTGCCCC GGACGCCGAG
GACTGCGCGG ACCAGCAGTG GTTTGACCTG CCCGCCGTCG AATACGACAA CGGGCACTAC
GGTCGAGGCG GATCTGAACG CCTGCCGATG GATGCGTGGC TGGAGTTCCT GGGCTGGGTT
ATCTCCGAGG GCGGCATCCT GTCCGAGGAG CGTGCTGGCA ACCGCTACGT AATGACGCTG
GCGCAGAAGA AGTACCCGCA GCGGATCCGT GACTGTCTCG CGCAGCTTCC GTTCTCTGCC
TATGAGTACT TCGACGAGGG CGGTCAGATC GCCCGGTGGA ACATCACCGG CAAGGGTCTG
ATCACCTGGC TGCGGGAGCA CGTAGGCACC AACTGCCAGG ACAAGCGCAT CCCGCGGTGG
TGCTTCAACC TGTCGCTCCG GCAGCGGCGC ATCCTGTTCG ACGCGATGAT GGCTGGCGAC
GGCAGTCGCG ATCCACGGCC GGGCCGCTCG AACTGCTACT ACGCGACTAG CTCCCCGGGC
CTTGCCGATG ACGTGCTGGA GTTCGTCTTC CGTCTCGGCT GCCGCGCCAA CATGGCGTCA
CACCATGACG GACGCAGCGC TCGCGCGCCG ATGTACTACA TCCACATCAC CGACCGGCCG
GCGTCCTGGG TGACGGGATC CAACAACGTT TCGACCGAAC GTTACGACGG CAAGGTCTAC
AGCCTTGAGG TGCCGAACCA CATCTACGTG ACCCGGCGCA ACGGGCGCAT CGCGATACAC
GGCAACTCCA ACGCCCAAAC CGCCGAAGAA GTCTTCGGAC GCTGGAAGAT CATCCCCCGG
CTTGACCGGC TCCGCGACAC TCTCAACAAC CAGTTCCTGC CGATGTTCGG CAGCACCGGC
GAGAACGTCG AATGGGACTA CGTCAACCCG CTGCCGGACG ACCGCGAAGC CGACAACGCC
GAGCTCACCG CGAAGACCAC GGCCTACGCC GCGCTCATCA GCGCAAACGT CAACCCGGAC
GACGCTGCCG AAGTGGTCGG CCTGCCGCGG ATGCGCCACA TCATTCAGGT CCCGGCCGCA
ACCCCCACTA CCGGCGCCGG CGAACTTGAG CAGGCCGACG ACAGCAGCGG CGAGCCTACC
GAGGACGACA ATCCCGTCGA GAACCAGGTG CGCAGTTCCA TCCACGCCCG CACCCTCACC
GCGATTACCG CAGCCACAGG CGATTCGGGG CCGACGCCGC CAGCGCAGGG CCCGAACCAG
CCGGACCTGT CCCGCGTCGA TGCACAGTGG AAGTCGGCCA CCGACAATGT CGTCGCCGAC
TACCAGAGCC AGATCCTGCC CGCCCAGCAG CAGCAGTTGC AGCAGCAGAT CCGCGAACAC
GTGGACGCTG CCGAACTAGC CGCGCTCGGA ACGCTGGCCG TCGACACGGT CGCCGCGAAG
GCGTTGCTGC TTGCCGCGAT GGTCGCGTTC GCCGTGGTTG CCGCTCGGGA AGCCAGCCGC
GAAGCCGACG AGCAAGGCGC CAGCGTGCCG CCGAAGACGC CGGACTCCTC CGAGCTCGAA
ACGATCGCCG CAGTAGCTGT CGCCCTTATG GCTTCCGAGC TTGCGGTATC GGCCGGGCGA
GAAGCGATGC GGCTTGCCGT TCCCGGAGTT GGCGGGCAGC AGGTTGCGGA CCAGGTTGGG
CAGTTCCTGC AAGGCCTGTC CGAAGCCGGC CCCCGCGGGC ACCTGGCCGG AGCGATGTCG
GCGGCACAGA ACCGGGCCCG GCACGCCACA TTCACGCAGC ACGGCGCCCC GCGTGGGCAG
CTCCGGGCCG TAGAAGTCTT GGACTCGGCA ACCTGCGACA ACTGCGAAGA GATCGACGGG
ACCGTCTTCG GCGACACCGA TGACCCGGAC GCCGTGGCCG AGGCGTTCGC CGCGTACCCC
ATGGGTGGCT ACACGCTTTG CGAGGGGCGC GAACGTTGCC GCGGAACCCT TTTCGTCGAC
TACGACGTCG GGCCTGACTC CGATAACCCC GGTGACGACC TGACTGCAAT GTTGAAGAAG
CTCGTCAACC TTCTCGAGCC GGAACCGGCG CACATTAACG GACACCACCA CGAGAAGGCG
AGGACGTGA
 
Protein sequence
MVRSMFNVVG SLLNKNANAS PVPYAAPGRY IIPALSGRSD NEVYMRAMGT AGTIFQIVSL 
LSSASATPQW RLYRKPKADG RQRYTTGDRG SDQRTEVLQH QALNVWNNPN PFTTGVNFRE
AGWQHMELTG EQWWVVVRDS RATFPTGLWL VRPDRMEPVP SAEKYIAGYV YTGPSGERVP
LQPEEVILTK YPNPLDPYRG LGPVQSVLVD IDAMKYGSEW NRNFFINGAV PGGVVTVPGN
MSDDEFDQFS TRWRESHQGV SRSHRVAILE GGATWVPTQM SIKDMDFSNL RNVSRDVLRE
AWGIHKSMLG NADDVNRCHD DQTEVLTNGG WKPFEKVEDA DLIATVNPAT RRIEYHAPIH
RFAYDYHGDM VRIVNDVIDV KVTPNHTMLY ATPRLPDVWR TCRADQLPSR FLVQTAPDAE
DCADQQWFDL PAVEYDNGHY GRGGSERLPM DAWLEFLGWV ISEGGILSEE RAGNRYVMTL
AQKKYPQRIR DCLAQLPFSA YEYFDEGGQI ARWNITGKGL ITWLREHVGT NCQDKRIPRW
CFNLSLRQRR ILFDAMMAGD GSRDPRPGRS NCYYATSSPG LADDVLEFVF RLGCRANMAS
HHDGRSARAP MYYIHITDRP ASWVTGSNNV STERYDGKVY SLEVPNHIYV TRRNGRIAIH
GNSNAQTAEE VFGRWKIIPR LDRLRDTLNN QFLPMFGSTG ENVEWDYVNP LPDDREADNA
ELTAKTTAYA ALISANVNPD DAAEVVGLPR MRHIIQVPAA TPTTGAGELE QADDSSGEPT
EDDNPVENQV RSSIHARTLT AITAATGDSG PTPPAQGPNQ PDLSRVDAQW KSATDNVVAD
YQSQILPAQQ QQLQQQIREH VDAAELAALG TLAVDTVAAK ALLLAAMVAF AVVAAREASR
EADEQGASVP PKTPDSSELE TIAAVAVALM ASELAVSAGR EAMRLAVPGV GGQQVADQVG
QFLQGLSEAG PRGHLAGAMS AAQNRARHAT FTQHGAPRGQ LRAVEVLDSA TCDNCEEIDG
TVFGDTDDPD AVAEAFAAYP MGGYTLCEGR ERCRGTLFVD YDVGPDSDNP GDDLTAMLKK
LVNLLEPEPA HINGHHHEKA RT