Gene SeD_A2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2072 
Symbol 
ID6875085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2005282 
End bp2006568 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID642785185 
Producthypothetical protein 
Protein accessionYP_002215851 
Protein GI198246165 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0000406999 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCTGGT TCATAGACCG ACGTCTTAAC GGCAAAAATA AAAGCACGGT GAATCGCCAG 
CGCTTTTTGC GCCGTTATAA AGCACAAATT AAGCAGTCAA TTTCCGAAGC GATTAATAAA
CGCTCTGTGA CCGATGTCGA CAGCGGAGAG TCCGTCTCTA TTCCAACCGA TGATATTAGC
GAACCGATGT TTCATCAGGG GCGCGGCGGT CTGCGCCATC GCGTCCATCC GGGTAACGAT
CACTTTATCC AGAATGATCG CATTGAGCGT CCGCAAGGCG GTGGCGGCGG CGGTTCCGGC
AGCGGTCAAG GTCAGGCCAG CCAGGACGGC GAAGGCCAGG ATGAGTTTGT TTTTCAGATT
TCAAAAGATG AATATCTGGA TCTGCTCTTT GAAGATTTAG CGCTGCCTAA TCTGAAGAAA
AACCAGCATC GCCAGCTTAA CGAGTATAAA ACTCACCGCG CCGGTTTCAC CTCAAACGGC
GTACCGGCCA ATATCAGCGT GGTACGTTCG CTACAAAACT CTCTGGCGCG CCGTACAGCA
ATGACGGCAG GAAAACGCCG CGAACTGCAC GCGCTGGAAA CGGAACTGGA GACCATCAGC
CATAGCGAAC CAGCGCAACT GCTTGAAGAG GAGCGGTTAC GTCGGGAAAT TGCCGAACTA
CGGGCTAAAA TCGAGCGAGT GCCGTTTATC GACACCTTTG ATTTACGCTA TAAAAATTAT
GAAAAACGGC CTGAGCCCTC CAGCCAGGCG GTGATGTTCT GTCTGATGGA CGTCTCGGGT
TCGATGGACC AGGCAACCAA AGATATGGCC AAGCGTTTTT ACATTCTGCT CTATCTGTTT
TTGAGCCGAA CATATAAGAA CGTAGAAGTG GTTTATATCC GCCACCATAC CCAGGCGAAG
GAAGTGGACG AACATGAGTT CTTTTATTCG CAAGAGACCG GGGGGACGAT TGTCTCCAGC
GCGCTTAAAC TCATGGATGA AGTGGTTAAA GAGCGCTACG ACCCGGGGCA GTGGAACATC
TATGCGGCGC AAGCGTCAGA CGGTGATAAC TGGGCCGACG ATTCACCGCT GTGTCATGAG
ATTCTGGCGA AAAAGCTGCT GCCGGTAGTG CGCTATTACA GCTATATCGA GATTACCCGC
CGCGCCCACC AGACCTTATG GCGCGAGTAT GAACATCTGC AGGCGACGTT CGATAACTTC
GCCATGCAGC ATATTCGCGA TCAGGAGGAT ATTTATCCGG TATTCCGCGA ATTGTTTCAG
AAACAGAGCG CCAATCAAAG CGTATAA
 
Protein sequence
MTWFIDRRLN GKNKSTVNRQ RFLRRYKAQI KQSISEAINK RSVTDVDSGE SVSIPTDDIS 
EPMFHQGRGG LRHRVHPGND HFIQNDRIER PQGGGGGGSG SGQGQASQDG EGQDEFVFQI
SKDEYLDLLF EDLALPNLKK NQHRQLNEYK THRAGFTSNG VPANISVVRS LQNSLARRTA
MTAGKRRELH ALETELETIS HSEPAQLLEE ERLRREIAEL RAKIERVPFI DTFDLRYKNY
EKRPEPSSQA VMFCLMDVSG SMDQATKDMA KRFYILLYLF LSRTYKNVEV VYIRHHTQAK
EVDEHEFFYS QETGGTIVSS ALKLMDEVVK ERYDPGQWNI YAAQASDGDN WADDSPLCHE
ILAKKLLPVV RYYSYIEITR RAHQTLWREY EHLQATFDNF AMQHIRDQED IYPVFRELFQ
KQSANQSV