Gene Ccur_02810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_02810 
Symbol 
ID8374489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp331349 
End bp332629 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content61% 
IMG OID644993205 
Productphage terminase, large subunit, PBSX family 
Protein accessionYP_003150690 
Protein GI256826731 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.965454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value3.50115e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAACG CTGCGCGCCT CATCATCTCC CACTTTCACA GCATCCTCGC CGACGTCTTC 
GCCGAGTGCG GTCATCACGA GTACTGGCTC GAGGGAGGGC GCGGCTCTAC CAAGTCGAGC
TTTATCAGCC TTGTAATCGT CCTTCTAGTA GCTACCTTCC CCTGGGTTAA TGCCGTTGTC
TTCCGCCGCC AGGCTAATAC GCTACGGGAT ACAGTCTACG GGCAGTTCCT CTGGGCGATA
GGCGCCCTCG GGCTTGATGG CTGCTTCTAC ACATCCAAGT CCCCGCTGGA GATCGTCTAC
CTGCGGACAG GGCAGAAGAT CATATTCCGC GGGCTAGACG ACCCGAAGAA GCGGAAGGGC
GCGGTGTTCC CTGTCGGCTA CTGTGCCGTC CAGTGGTTTG AGGAGCTCGA CGAGTTCAAC
GGCTGGGATG ACATCTCATC GACGCTCCGC ACGTATCGCC GTGGCGGGTC GAGGTTTTGG
ACGTTCTACA GCTATAACCC TCCGCGGTCT CTGTGGTCGT GGGTAAACAA GAAGGCTCTG
GAGATGCAGC GCAAGCGGGG GTGTGTGGTA GACCACAGCA CCTATCTCGA CGTGATCGAC
GGCGGTCATA GTGACTGGCT CGGCGAGAAA TTTATAGAGG ATGCCGACTA CGAGAAAGAG
GAGCACCCGA CGGGCTACCG CTGGGAGTTC CTCGGCGAGA TCACGGGGAC GGGCGGCAGC
GTCTTTGAGA ATGTCGTACA GGTGAGCCTA AGCGACAAGG AGGTAGAGAG CTTCGATAAC
CTCCGCTGCG GCGTGGACTG GGGCTGGTTC CCCGATCCCT GGCGCTTCGT CATGTGCGAG
TGGCAGCCCG CACGGCGTCG CCTGGTGCTC TTCCGCGAGC TATCGGCTAA CCGTACCACG
CCGCAGGATA CGGGGGCGAT GGTACGCGAG GCGCTGACGT ATCGGGATGC GCGGCACAAG
GAGCCGACGT ACCACCGCGA CGCGGTCTGG TGCGATAGTG CCGAGCCGTC GAGTATCGAT
ATATACCGCC GGCAGTGCGG ACTGAATGCC CGGGCGGCAG ACAAGGGCGG GATGAGACGC
GTGAGCTACC AGTGGCTTGA GGGACTACGG GAGATCGCTA TCGACCCCGA GCGATGCCCG
AGAGCCTGGG AGGAGTTCAC CCTGTGTGAG TACGCCAAGG ATCGCGCGGG GAGATGGCTC
GATGACTACA ACGACGGTAA TGACCACAGT ATCGACGCAG TGCGCTACGC GATGATGCGC
GAGTGCGTGA GAGGAGCATA G
 
Protein sequence
MINAARLIIS HFHSILADVF AECGHHEYWL EGGRGSTKSS FISLVIVLLV ATFPWVNAVV 
FRRQANTLRD TVYGQFLWAI GALGLDGCFY TSKSPLEIVY LRTGQKIIFR GLDDPKKRKG
AVFPVGYCAV QWFEELDEFN GWDDISSTLR TYRRGGSRFW TFYSYNPPRS LWSWVNKKAL
EMQRKRGCVV DHSTYLDVID GGHSDWLGEK FIEDADYEKE EHPTGYRWEF LGEITGTGGS
VFENVVQVSL SDKEVESFDN LRCGVDWGWF PDPWRFVMCE WQPARRRLVL FRELSANRTT
PQDTGAMVRE ALTYRDARHK EPTYHRDAVW CDSAEPSSID IYRRQCGLNA RAADKGGMRR
VSYQWLEGLR EIAIDPERCP RAWEEFTLCE YAKDRAGRWL DDYNDGNDHS IDAVRYAMMR
ECVRGA