Gene SeD_A4064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4064 
Symbol 
ID6871092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3906148 
End bp3908103 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content57% 
IMG OID642787013 
Producthypothetical protein 
Protein accessionYP_002217640 
Protein GI198242542 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAC TGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGACAG 
TATCAACAAC TGGTTCGCGA TGTGGTTATT CCTTACCAGT GGGATGCGTT AAACGATCGT
ATTCCAGAGG CTGAACCCAG CCATGCCATT GAAAATTTCC GCATTGCCGC AGGCCAGCAG
ACGGGCGACT TTTACGGCAT GGTCTTTCAG GACAGCGACG TGGCGAAATG GCTGGAAGCG
GTTGCCTGGT CACTGTGCCA GAAGCCCGAT CCAGCGCTTG AGAAAACCGC CGATGAGGTG
ATTGAACTGG TGGCCGCCGC GCAGTGTGAC GATGGCTATC TCAATACGTA CTTTACGGCA
AAAGCCCCGC AAGAACGCTG GAGCAACCTG GCGGAGTGCC ACGAGCTTTA TTGCGCCGGG
CACCTGATTG AAGCGGGCGT CGCCTTCTTT CAGGCCACTG GCAAGCGTCG GCTGCTAGAC
GTCGTTTGTC GCCTGGCCGA TCATATCGAC AGCACTTTCG GCCCTGGCGA AAATCAGCTG
CACGGCTATC CGGGCCACCC GGAAATTGAG CTGGCGCTGA TGCGTCTGTA TGAGGTAACA
GAGCAGCCGC GCTATATGGC GCTGGCAAGC TACTTTATCG GGCAGCGCGG CGTCCAACCG
CACTTCTACG ACGAAGAGTA CGAAAAACGC GGCCAGACCT CGTACTGGCA TACCTACGGC
CCGGCGTGGA TGGTCAAAGA CAAAGCCTAC AGCCAGGCGC ATCTGCCAAT TTCGCAGCAG
CAGACGGCCA TTGGCCACGC GGTACGTTTT GTCTATCTGA TGACTGGCGT GGCGCATCTC
GCTCGCCTGA GCAACGATGA AGGCAAACGC CAGGACTGCC TGCGCCTGTG GAAAAATATG
GCGCAGCGTC AGCTGTATAT CACCGGCGGG ATTGGCTCGC AGAGCAGCGG CGAAGCCTTT
AGCAGCGATT ACGATTTACC GAATGATTCG GTCTATGCGG AAAGCTGCGC TTCAATCGGC
CTGATGATGT TCGCCCGCCG GATGCTGGAA ATGGAAGCCG ATAGCCAGTA CGCCGACGTG
ATGGAGCGCG CGCTGTACAA CACCGTCCTC GGCGGTATGG CGCTGGATGG CAAGCATTTC
TTCTACGTCA ACCCGCTGGA AGTGCATCCA AAATCGTTAA AATTCAACCA TATTTACGAT
CACGTTAAGC CCATCCGCCA GCGCTGGTTT GGCTGCGCCT GCTGCCCGCC GAATATCGCC
CGCGTGCTCA CCTCCCTTGG TCACTACATC TACACGCCGC GTGCGGATGC GCTGTACATC
AATATGTACG TGGGTAACAG CATGGAAATA CCGGTTGGAA ATGGCGCGCT CAAACTGCGG
ATTGGCGGGA ACTACCCGTG GCAAGAGCAG GTGAAGATCG CCATCGACTC TGTGCAGCCG
GTACGTCACA CGCTGGCGCT ACGTCTGCCG GACTGGTGCC CTGAGGCAAA AGTGACGCTC
AACGGGCTGG AAGTGGAGCA GGATATTCGC AAAGGTTATC TGCATATCCG TCGGACCTGG
CAGGAGGGCG ATACGATAAC CCTGACGCTG CCGATGCCGG TTCGCCGCGT GTATGGCAAT
CCGCTGGCGC GTCACGTCGC CGGTAAGGTC GCCATTCAGC GCGGGCCGCT GGTCTATTGC
CTTGAGCAGG CCGATAACGG CGAAGAACTG CATAATCTGT GGTTACCGAA AGAGAGTGAG
TTCCGGGTCT TTGAGGGCAA AGGGCTTTTT GCGCATAAGA TGCTGATTCA GGCTGAAGGC
GAGAAGCAAA GCGCCCTAGA TGCGCAGCAT CAGGCGTTGT GGCACTACGA TAACGCGCCG
TCATCGCGCC AGCCGCAGAC GCTAACGTTC ATTCCGTGGT TTAGCTGGGC CAACCGTGGC
GAGGGCGAAA TGCGGATTTG GGTTAACGAG CGGTAA
 
Protein sequence
MNVLEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGQQ 
TGDFYGMVFQ DSDVAKWLEA VAWSLCQKPD PALEKTADEV IELVAAAQCD DGYLNTYFTA
KAPQERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLD VVCRLADHID STFGPGENQL
HGYPGHPEIE LALMRLYEVT EQPRYMALAS YFIGQRGVQP HFYDEEYEKR GQTSYWHTYG
PAWMVKDKAY SQAHLPISQQ QTAIGHAVRF VYLMTGVAHL ARLSNDEGKR QDCLRLWKNM
AQRQLYITGG IGSQSSGEAF SSDYDLPNDS VYAESCASIG LMMFARRMLE MEADSQYADV
MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA
RVLTSLGHYI YTPRADALYI NMYVGNSMEI PVGNGALKLR IGGNYPWQEQ VKIAIDSVQP
VRHTLALRLP DWCPEAKVTL NGLEVEQDIR KGYLHIRRTW QEGDTITLTL PMPVRRVYGN
PLARHVAGKV AIQRGPLVYC LEQADNGEEL HNLWLPKESE FRVFEGKGLF AHKMLIQAEG
EKQSALDAQH QALWHYDNAP SSRQPQTLTF IPWFSWANRG EGEMRIWVNE R