Gene SeD_A1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1648 
Symbol 
ID6872066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1589675 
End bp1590736 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID642784792 
Producthypothetical protein 
Protein accessionYP_002215460 
Protein GI198242282 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.000331531 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAAC CGTTAAAACC GCGTATTGAT TTTGCAGAAC CGCTAAAGGA GGAACCTACG 
TCGGCCTTCA AAGCGCAGCA AACTTTTAGC GAAGCGGAGT CGCGTACATT TGCGCCTGCA
GCTATCGATG AGCGCCCGGA AGACGAAGGC GTGGCAGAAG CGGCGGTCGA TGCCGCGCTG
CGCCCCAAAC GCAGTCTGTG GCGTAAAATG GTGATGGGAG GGCTGGCGCT GTTTGGCGCG
AGCGTGGTCG GGCAAGGCGT ACAGTGGACA ATGAATGCCT GGCAAACTCA GGACTGGGTC
GCTTTAGGCG GCTGTGCCGC AGGCGCGCTG ATCATTGGCG CTGGCGTGGG ATCGGTGGTC
ACGGAGTGGC GGCGATTATG GCGCTTGCGC CAGCGGGCGC ATGAGCGCGA TGAGGCGCGT
GAACTGTTAC ATAGCCATAG CGTCGGGAAA GGTCGCGCAT TTTGCGAAAA ACTGGCGCAG
CAGGCGGGGA TTGATCAATC ACATCCGGCA TTACAACGTT GGTATGCCGC TATTCACGAA
ACGCAAAACG ACAGGGAAAT CGTCGGTTTG TATGCGCATC TGGTACAGCC GGTACTTGAC
GCGCAGGCGC GACGTGAGAT TAGCCGTTTC GCCGCGGAAT CGACTCTGAT GATCGCCGTC
AGCCCGTTAG CGTTGGTGGA TATGGCGTTT ATTGCCTGGC GTAATTTACG CCTGATTAAC
CGCATCACAA CGCTGTATGG CATTGAACTT GGTTATTACA GCCGCCTTCG TCTGTTCCGT
CTGGTGTTGC TGAATATCGC GTTTGCGGGA GCCAGTGAGC TGGTGCGTGA AGTCGGTATG
GACTGGATGT CTCAGGATCT GGCCGCACGC TTGTCCACGC GTGCGGCGCA GGGGATTGGC
GCAGGTCTCC TTACCGCTCG ACTGGGAATA AAAGCGATGG AGCTATGTCG ACCATTGCCG
TGGATCGATA ACGATAAACC ACGTCTCGGT GATTTTCGTC GTCAGCTTAT CGGTCAGCTA
AAAGAGACGC TGCAAAAGAG TAAGTCGTCG CCTGAGAAAT GA
 
Protein sequence
MSEPLKPRID FAEPLKEEPT SAFKAQQTFS EAESRTFAPA AIDERPEDEG VAEAAVDAAL 
RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV
TEWRRLWRLR QRAHERDEAR ELLHSHSVGK GRAFCEKLAQ QAGIDQSHPA LQRWYAAIHE
TQNDREIVGL YAHLVQPVLD AQARREISRF AAESTLMIAV SPLALVDMAF IAWRNLRLIN
RITTLYGIEL GYYSRLRLFR LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG
AGLLTARLGI KAMELCRPLP WIDNDKPRLG DFRRQLIGQL KETLQKSKSS PEK