Gene SeD_A1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1051 
Symbol 
ID6874712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1053174 
End bp1054406 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content55% 
IMG OID642784236 
Producthypothetical protein 
Protein accessionYP_002214910 
Protein GI198245764 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.241147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTGC CGTACCTTTC TCTTTCCCAG GCCCGTTGTC TTCACCTTGC TGCGCAGGGG 
CTATTGAAAA AGCCGCGCCG TAACGCGCTG CCTGGCGATG TTCTTGCCGC CATCTCACGC
ATGGCGTTGC TGCAAATTGA TACCATCAAT GTTGTCGCAC GTAGCCCCTA TCTGGTGCTG
TTTAGCCGTC TCGGTTCGTA CCCGCAGGCC TGGCTGGATG AGGCGCTGCG ACGCGGCGAG
TTAATGGAAT ACTGGGCGCA TGAGGCCTGT TTCTTACCAC GCCGCGACTT TAAACTTATC
CGCCATCGTA TGCTGTCGCC GGAAAAGATG GGCTGGAAAT ATCGCGCGGC ATGGATGCAT
GAGCACGCGG AAGAAATAGA ACAGCTAATG CGGCATATTC AGGAGCACGG CCCGGTGCGA
TCTGCCGATT TTGAACATGC GCAGAAAGGC GCCAGCGGCT GGTGGGAATG GAAACCACAT
AAACGCCACC TTGAGGGTTT ATTTACCGCC GGAAAAGTCA TGGTTGTTGA GCGGCGTAAT
TTTCAACGTG TATATGATTT AACGCGCCGT GTGATGCCGC ACTGGGATGA TGAACGCGAT
GGACTGTCAC AGCCGCAGGC GGAAAGCCTG ATGCTGGATA ATAGCGCGCG CAGTCTGGGG
ATTTTCCGTG AACAGTGGCT GGCGGATTAC TACCGCCTGA AACGTCCTGA CCTGAAAGGA
TGGCGGGAGA GCCGGGCGGA ACAGCAGCAG ATTATTCCGG TCGAGGTGGA AACGTTGGGG
CGGATGTGGC TTCATGCCGA TCTTCTTTCG CAGCTTGAAC CGGCGCTAAA TAACGCCTTA
AAGGCGACCC ATAGCGCAGT GCTGTCGCCT TTCGATCCTG TGATATGGGA TCGCAAGCGG
GCAGCGCAGC TTTTCGGATT TAACTATCGG CTGGAATGTT ATACGCCTGC GGCGAAGCGC
CAGTACGGTT ATTTTGTGCT GCCGCTATTA TACCAGGGCC GTTTAGTCGG GCGAATGGAC
GCCAAAATGC ACCGTAAAAC GGGGGTACTT GAGGTTATCT CGCTGTATCT GGAGGACGAT
ATTCGCCCTG GCGTTAGTCT GCAAAAAGGA ATCTGGCAGG CCATTAGCGC GTTTGCTGCC
TGGCAACGGG CATCGCGCGT GACGCTGGGA CAATGTCCGC CAGGCCTGTT TAGCGCCATG
CGTCATGGCT GGGAAATAGA CCCTGCACCA TAA
 
Protein sequence
MSLPYLSLSQ ARCLHLAAQG LLKKPRRNAL PGDVLAAISR MALLQIDTIN VVARSPYLVL 
FSRLGSYPQA WLDEALRRGE LMEYWAHEAC FLPRRDFKLI RHRMLSPEKM GWKYRAAWMH
EHAEEIEQLM RHIQEHGPVR SADFEHAQKG ASGWWEWKPH KRHLEGLFTA GKVMVVERRN
FQRVYDLTRR VMPHWDDERD GLSQPQAESL MLDNSARSLG IFREQWLADY YRLKRPDLKG
WRESRAEQQQ IIPVEVETLG RMWLHADLLS QLEPALNNAL KATHSAVLSP FDPVIWDRKR
AAQLFGFNYR LECYTPAAKR QYGYFVLPLL YQGRLVGRMD AKMHRKTGVL EVISLYLEDD
IRPGVSLQKG IWQAISAFAA WQRASRVTLG QCPPGLFSAM RHGWEIDPAP