Gene Caul_3486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3486 
Symbol 
ID5900941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3764291 
End bp3765475 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content57% 
IMG OID641563992 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001685111 
Protein GI167647448 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.168874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTG AAAAACTAGA TGGTCTGATC GCAGAAATGC GCCTGAAGGC CGATGCAACT 
GACGTAGTTG CAAAGGGTCA ACTGGACGAC CTAAGCGCTC ACGTTAAGTC GGAAATCGCG
GCAATCCGCT CCGACATGGC TGCACAAAAT GCACCACGCG AAGACGCACA AGACACCGCT
GAAATCCACC AAAAGGCCGT TGCCGAATTC ATGGTCCACG GCTCCAAGAT GGTCGAAGCC
GGTAAGATCA AGGCCACTGA CTACGCAGTC AGCGTCTCGG CCGATGGCGG CATCACCGTC
CCAACCAACA TCCACACCGA CGTTGTCGAA AAGCTCCGTA AGTCGAGCCC ACTGCTCGGC
CTATCGACCG TGACCTCGCT GAAGGGTCTG CAAAAGCTCC TGCTGAAGAC CAGCGCTGCT
CAAGCTAACA CCCGTTCAGA GCGCGGCGCG TTCACCGACA ACACCGTTGA AGGCTTCGCA
GGTATCACTC TGGGCTCGAA GGACGTTTAC GACCGTCAAG CCCACACTGT TGAATCGGTC
GAAGGCGACT CCGTTCTGGA CTTCCAGCAA GTCCTACTGG CTTCGATGAA TGCCGGTATC
GCTGAGAAGA TGGCGAGCGA ACTGCTGAAC AGCACCGTGA CCAACGCCGT TGAAAACGTC
GCAGGCACAA CCACCATCCC AGCCGGTATC CTGAACCGCG TGACCGAAGT TGCGGCTGAC
CGCTTCACCG GTAGCATCGG CAAGACCCCT TGCCTGACCA CTGCCGGTAT CAACCTCGTC
ACTTTCGAGG ACATGATTAA GCTCTACAAC AGCCTGCACA CCAAGTACCG CAACTCGGCG
ACCTTCGTCA CCGGCAGCGA TATTGAGCTG CAACTGATGC TCGTGAAGGA CTCGACCGGC
AACTACATCT GGCAACCAGC CGTGGGTCAG ACCTACCAAG CTGTTATCCA GGGCCGTCCT
GTCGTCATCG ACGACTTCAT GCCCGGCGTG ACCGGTTCGG CAGGCGTTCC TCGCGTTCTG
TTCGCCGACT TCTCGGAAAA CTACGTCAAC TTCTCGGGTC CAATGCAGTG GGTCGTCGAC
CCATACACCG CACCTGGCTA CGTGAAGTAC ACCGCGCGTC AGCGTTTCGG TTCGGTCTTC
CGCGACAGCC AAGCAATCCG TGGTCTGCTC CTAAAGACCT CGTAA
 
Protein sequence
MSIEKLDGLI AEMRLKADAT DVVAKGQLDD LSAHVKSEIA AIRSDMAAQN APREDAQDTA 
EIHQKAVAEF MVHGSKMVEA GKIKATDYAV SVSADGGITV PTNIHTDVVE KLRKSSPLLG
LSTVTSLKGL QKLLLKTSAA QANTRSERGA FTDNTVEGFA GITLGSKDVY DRQAHTVESV
EGDSVLDFQQ VLLASMNAGI AEKMASELLN STVTNAVENV AGTTTIPAGI LNRVTEVAAD
RFTGSIGKTP CLTTAGINLV TFEDMIKLYN SLHTKYRNSA TFVTGSDIEL QLMLVKDSTG
NYIWQPAVGQ TYQAVIQGRP VVIDDFMPGV TGSAGVPRVL FADFSENYVN FSGPMQWVVD
PYTAPGYVKY TARQRFGSVF RDSQAIRGLL LKTS