Gene SeD_A2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2271 
Symbol 
ID6873019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2165627 
End bp2167360 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content51% 
IMG OID642785369 
Productputative phage terminase, large subunit 
Protein accessionYP_002216031 
Protein GI198243032 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.441488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000000000278373 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGGA AATCATATCC CAACGTTAAC GCCGCTAATC AGTATGCCCG CAACGTTGTG 
CGGGGAAAAA TCACGGCATG TCAGTATGTC ATTCAGGCCT GCCAGCGTCA CATTGATGAT
ATGGCGGCGG AGAAGAGTAA AAGGTTTCGG TACCGCTTTG ATAAAGACAT GGCTGAGAAA
GCTGCAAAGT TTATTCAGTT ACTTCCACAT ACAAAAGGTG AATGGGCGTT CAAACGTATG
CCGATTACCC TGGAACCGTG GCAACTTTTC ATCGTGTGCT GTGCCTTTGG CTGGGTACAG
AAGGGAACAA AGCTTCGTCG TTTTCGTGAG GTCTACACAG AGATACCACG TAAGAACGGG
AAATCGGCTA TTTCAGCTGG TGTAGCTCTC TACTGTTTCA CCTGTGATAA CGAATTCGGT
GCGGAAGTAT ACTCCGGCGC CACGACTGAA AAACAGGCGT GGGAGGTATT TCGTCCCGCG
CGTCTGATGT GTAAGCGCAC ACCACTACTG GTGGAGGCAT TCGGTATAGA GGTGAATGCC
TCAAACCTGA ACCGTCCGGA GGATGGTGCC CGCTTCGAGC CGTTGATCGG CAACCCCGGG
GACGGGGCAT CACCGCACTG CGCAATAGTT GACGAATACC ACGAACACCC TACGGATGCG
CTCTACACAA CAATGCTTAC AGGTATGGGC GCGCGCCGAC AGCCGCTGAT GTGGGCAATA
ACCACGGCGG GCTACAACAT CGAGGGGCCG TGTTACGACA AGCGACGCGA AGTGATTGAG
ATGCTGAACG GATCGGTGCC GAACAACGAA CTTTTTGGCG TGATTTACAC GGTTGATGAA
GGGGATGACT GGACAGATCC AAAAGTGCTG GAGAAAGCAA ACCCGAACAT TGGGGTGTCA
GTATACCGTG ACTTCCTTCT CAGTCAGCAA CAGCGTGCTA TTAACAATGC CCGCCATGCG
GGTGTGTTCA AAACGAAGCA TCTCAATGTA TGGGTTGCCG CCCGCACAGC ATTCTTTAAT
CTGGTTTCCT GGCAAAACTG TGAGGATAAG ACGCTGACGC TGGAACTGTT TGAGGGTCAA
CCCTGCGTAC TGGCGTTCGA TCTGGCTCGT AAGCTGGACA TGAACAGCAT GGCGAGGTTA
TTTACCCGTG AAATAGACGG GAAAACGCAT TTTTACAGCG TGGCGCCACG TTTCTGGGTG
CCGTATGACA CGGTCTACAG TGTTGAGAAA AATGAGGATC GCCGTACTGC GGAACGTTTT
CAGAAATGGG TTGAAATGGG CTTTTTGACA GTAACTGATG GTGCGGAGGT GGATTACCGC
TACATCCTTG AAGAGGCCAA AGCTGCGAAC AAACTGAACC CGGTCAGCGA ATCCCCCATT
GATCCATTTG GTGCCACCGG GCTTTCACAT GATCTAGCTG ATGAAAACCT GAATCCCGTC
ACTATCATCC AGAATTACAC CAACATGTCC GATCCGATGA AAGAACTGGA AGCGGCGATT
GAATCGGGTC GCTTTCATCA TGACGGCAAT CCCATCATGA CCTGGTGTAT CGGCAACGTG
GTCGGCAAAA CCATTCCGGG TAACGATGAC GTGGTGAAGC CTATTAAGGA GCAGGCGGAA
AATAAAATCG ATGGTGCAGT TGCACTGATT ATGGCGGTTG GCAGAGCCAT GCTGTACGAG
AAAGAAGACA CGCTGTCTGA CCACATTGAG TCCTACGGGA TCCGCTCGCT TTAA
 
Protein sequence
MSRKSYPNVN AANQYARNVV RGKITACQYV IQACQRHIDD MAAEKSKRFR YRFDKDMAEK 
AAKFIQLLPH TKGEWAFKRM PITLEPWQLF IVCCAFGWVQ KGTKLRRFRE VYTEIPRKNG
KSAISAGVAL YCFTCDNEFG AEVYSGATTE KQAWEVFRPA RLMCKRTPLL VEAFGIEVNA
SNLNRPEDGA RFEPLIGNPG DGASPHCAIV DEYHEHPTDA LYTTMLTGMG ARRQPLMWAI
TTAGYNIEGP CYDKRREVIE MLNGSVPNNE LFGVIYTVDE GDDWTDPKVL EKANPNIGVS
VYRDFLLSQQ QRAINNARHA GVFKTKHLNV WVAARTAFFN LVSWQNCEDK TLTLELFEGQ
PCVLAFDLAR KLDMNSMARL FTREIDGKTH FYSVAPRFWV PYDTVYSVEK NEDRRTAERF
QKWVEMGFLT VTDGAEVDYR YILEEAKAAN KLNPVSESPI DPFGATGLSH DLADENLNPV
TIIQNYTNMS DPMKELEAAI ESGRFHHDGN PIMTWCIGNV VGKTIPGNDD VVKPIKEQAE
NKIDGAVALI MAVGRAMLYE KEDTLSDHIE SYGIRSL