Gene SeD_A3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3035 
Symbol 
ID6872500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2932853 
End bp2933761 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content59% 
IMG OID642786065 
Productbaseplate assembly protein J 
Protein accessionYP_002216711 
Protein GI198242076 
COG category[R] General function prediction only 
COG ID[COG3948] Phage-related baseplate assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.805786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000161965 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGTCA TTGACCTTTC CCAGTTGCCT GCGCCGCAGA TAGTGGACGT GCCGGATTTT 
GAGACGCTGC TGGCTGAGCG CAAGGCCGCT TTTGTGCTCC TTTATCCGGC GGATGAACAG
GACGCGGTGC GGCGCACACT GGCGCTGGAA TCTGAACCCG TCACCAAGCT GCTGCAGGAA
AGTACATACC GCGAAATCCT GCTGCGCCAG CGTATTAACG AGGCTGCGCA GGCGGTCATG
GTGGCCTATT CGATAGGAAA TGATCTTGAG CAGCTGGCAG CCAACTGCAA CGTGAAACGT
CTGACGGTAG TGCCTGCTGA TAATGATGCA GTACCGCCGG TCGCCGCAGT GATGGAAGAT
GATGATGCGC TGCGCCAGCG CATCCCTGCA GCATTTGAGG GACTGTCCGT TGCTGGCCCG
ACGGGAGCCT ATGAATTTCA CGCCAGAAGT GCGGACGGAC GTGTGGCAGA TGCCAGCGCA
ACCAGTCCGG CCCCTGCAGA GGTGGTACTT ACCGTACTGA GCCGGGAGGG TGACGGTACA
GCAGTAAAAG ACCTGCTGGA TGTGGTTGAA AAAGCCCTGA ACAGTGAGAG TGTACGCCCG
GTGGCTGACC GTCTGACGGT TCGTAGTGCG GAGATCATAC CGTACCGGGT GGAGGCTACC
ATTTTTCTTT ATCCGGGGCC GGAAGCGGAG CCTGTTATGG CGGCGGCAAA AGCCAGCCTG
CAGAAGTACA TCGCCAGTCA GACGAGGCTG GGACGTGATA TCCGCCGCAG CGCCATTTAT
GCCGCGCTGC ACGTGGAGGG CGTCCAGCGT GTGGAGCTAA CGTCCCCTCT GGAGGATGTG
GTGCTGGATA AGACGCAGGC GGCATCCTGT ACTGAATGGA GCGTTACCAA CGGGGGCACG
GATGAATAG
 
Protein sequence
MAVIDLSQLP APQIVDVPDF ETLLAERKAA FVLLYPADEQ DAVRRTLALE SEPVTKLLQE 
STYREILLRQ RINEAAQAVM VAYSIGNDLE QLAANCNVKR LTVVPADNDA VPPVAAVMED
DDALRQRIPA AFEGLSVAGP TGAYEFHARS ADGRVADASA TSPAPAEVVL TVLSREGDGT
AVKDLLDVVE KALNSESVRP VADRLTVRSA EIIPYRVEAT IFLYPGPEAE PVMAAAKASL
QKYIASQTRL GRDIRRSAIY AALHVEGVQR VELTSPLEDV VLDKTQAASC TEWSVTNGGT
DE