Gene SeD_A0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0310 
Symbol 
ID6873579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp329155 
End bp331344 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content57% 
IMG OID642783553 
ProductRhs family protein 
Protein accessionYP_002214241 
Protein GI198243551 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.473286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG TATCCACAAA TAATAAATCC GGTATGGGAG GGCTGACGAC AACCACGCCG 
CCGATAACCG GAGAAAGTGG CGGTGTCACC GCAGATTCAG TCGCCGGAAG CGTGGCAGAT
GCGGCGGAAT CCGCCGTGGA ACAGGCTGCG GGATCGCTGT TTGGCGCATT GCCGGAGCCA
TCAGGACTGG TGAAAGCCGC GGTAGCAGCG GCGCAGGCTG CCGCCGCCGC AGGTATGGCG
CAGGATGCGG TATCGGCCAT CGTCTCTGCT GTTGCAGACG GGCCGGGGGC GCATAATGTG
ACGGTCAGCG GCAGCGCCGT ACCGCCGGGC GCATTACTGT TCGCCAGCCT GGACGGCGGC
GAAACATTAA GTGAACTGTT CAGCTATGTG GTACAGCTAA AAACGCCCGA CACCCTGAAT
CTGGGCTATG TCTCCCCGGC GGCCAACCTG CCGCTCAAAC CGATGGTGGG CAAAGATCTG
TGCGTCAACA TCGAACTGGA TGGTGGCGGT AAACGACATA TCAGCGGGCT GGTCACGGCG
GCGCGGGTGG TGGGCCATGA AGGGCGTTCG GTTACCTATG AGCTGCGTAT GGAGCCGTGG
CTGAAACTGC TGACCCATAC CAGCGACTAC AAAGCATTCC AGAATAAAAC CGTGGTGGAT
ATTCTGGATG AGGTTCTGGC GGAATACCCC TACCCGGTGG AAAAGCGGCT GGTGGAAAGC
TACCCGGTAC GCACCTGGCA GGTGCAGTAC GGTGAAACTG ATTTTGATTT TCTTCAGCGA
CTGATGCAGG AGTGGGGCAT CTACTGGTGG TTTGAGCACA GCGAGGACAG CCACACGCTG
GTGCTGGCGG ATGCCATCAG CGCCCACAAA GCATGTCCGG ACTCGCCGCT GGTCGAGTGG
CACCAGGAAG GGCTGAAGCT GGACAAGGAG TTTATCCACA CTATCACGGC AAACGAGAGT
CTGCGGACTG GCCAGTGGGT GCTGGATGAT TTCGATTTTA CGAAGCCACG TTCATTGCTG
GCAAACACGG TGGCAAACCC GCGTGAAACC GGTCATGCCA CCTACGAGCA TTATGAGTGG
CCGGGAGACT ACTTCGACAA GAGTGAAGGC GAGATGCTGA CGCGCATTCG TATGGAAGCG
CAGCGCAGCC CCGGCAGCCG GGTGCTGGGT GCCGGGAATA TTCGTACGCT GATGACAGGC
TATACCTTCA CGCTGGAAAA CTATCCCACC GCCGAAGTCA ATCAGGAATA TCTGCTGATG
CAGACCTTGC TGTTTGTGCA GGACAACGCG CAGCACAGCG GGCAGGACCA GCACTTTACC
TTTTCCACCC GTTTTGAACT GCACCCCACC CGCGAGGTGT TCCGCCCGCA GCGGACGGTG
AGCAAACCCC ACACCAAAGG GCCGCAGAGC GCCATTGTCA CCGGCCCGTC GGGCCAGGAA
ATCTGGACGG ATCAGTACGG GCGGGTAAAG GTACAGTTTG GCTGGGATCG CTACGGCAAA
ATGGATGAAA ACAGCTCCTG CTGGATACGC GTCAGCTACC CATGGGCGGG CAAAGGCTTC
GGGATGATCC AGATCCCGCG TATCGGCCAG GAAGTGCTGG TGGATTTCAA AAACGGCGAT
CCGGATCTGC CGATCATCGT GGGGCGTACC TACAACCAGG ACACCATGCC GCCGTGGGGA
CTGCCGGGAA TGGCGTCGCA GAGCGGGATC TTCAGCCACT CGCTGTATGG CGGGCCAACG
AACGGCAACA TGCTGCGTTT TGACGACAAA ACGGGCGCGG AGGAAGTGAA GTTCCACGCG
GAAAAAGATC TCAACACCAC GGTGAAGAAT AATGAAACGC ATACGGTTAT GGTGGATCGC
ACTAAAACCA TTATTAAAAA TGAAACCAAC AGTATTGGTG AGGACAGAAA CACCACGGTA
ACGAAGAATG ACGGCCTTTC CGTAAAACTG GCGCAGACGA TCAATATCGG CACCACTTAT
CGTTTAGATG TTGGCGATCA ATTCACACTT CGCTGCGGCA ATGCGGCGCT TGTTTTACAT
AAGGACGGCT CCATTGAGTT TTGTGGCAAG CAACTGATGT TACATACCAG CGATGTCATG
CAACTGATTG GTAAAGGTAT TGATATGAAC CCGGATGGCG GCACAGCCGT AACCGCCGAT
GATATTGCCC CCCTTCCCAC CTCTGAGTGA
 
Protein sequence
MSFVSTNNKS GMGGLTTTTP PITGESGGVT ADSVAGSVAD AAESAVEQAA GSLFGALPEP 
SGLVKAAVAA AQAAAAAGMA QDAVSAIVSA VADGPGAHNV TVSGSAVPPG ALLFASLDGG
ETLSELFSYV VQLKTPDTLN LGYVSPAANL PLKPMVGKDL CVNIELDGGG KRHISGLVTA
ARVVGHEGRS VTYELRMEPW LKLLTHTSDY KAFQNKTVVD ILDEVLAEYP YPVEKRLVES
YPVRTWQVQY GETDFDFLQR LMQEWGIYWW FEHSEDSHTL VLADAISAHK ACPDSPLVEW
HQEGLKLDKE FIHTITANES LRTGQWVLDD FDFTKPRSLL ANTVANPRET GHATYEHYEW
PGDYFDKSEG EMLTRIRMEA QRSPGSRVLG AGNIRTLMTG YTFTLENYPT AEVNQEYLLM
QTLLFVQDNA QHSGQDQHFT FSTRFELHPT REVFRPQRTV SKPHTKGPQS AIVTGPSGQE
IWTDQYGRVK VQFGWDRYGK MDENSSCWIR VSYPWAGKGF GMIQIPRIGQ EVLVDFKNGD
PDLPIIVGRT YNQDTMPPWG LPGMASQSGI FSHSLYGGPT NGNMLRFDDK TGAEEVKFHA
EKDLNTTVKN NETHTVMVDR TKTIIKNETN SIGEDRNTTV TKNDGLSVKL AQTINIGTTY
RLDVGDQFTL RCGNAALVLH KDGSIEFCGK QLMLHTSDVM QLIGKGIDMN PDGGTAVTAD
DIAPLPTSE