Gene SeD_A2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2574 
Symbol 
ID6871098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2455122 
End bp2456882 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content57% 
IMG OID642785649 
Productputative helicase 
Protein accessionYP_002216307 
Protein GI198244449 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.452066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.00320977 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTTTA CACTCCGCCC CTACCAGCAA GAAGCCGTAG ACGCCACGCT CAGCCACTTT 
CGCCGCCACC GTACGCCCGC CGTGATTGTT CTGCCGACCG GCGCAGGTAA AAGCCTGGTG
ATCGCCGAAC TGGCGCGCGT CGCCCGCGGA CGGGTACTGG TGCTGGCGCA TGTGAAAGAG
CTGGTCGCGC AGAACCACGC CAAATATTGC GCGCTGGGGC TGGAAGCGGA TATTTTCGCC
GCCGGACTCA AACGTAAAGA GAGTCAGGGC AAAGTCGTGT TCGGCAGCGT ACAGTCGGTG
GCGCGTAATC TTGACGCCTT CCAGGAGGAG TTTTCGCTGT TGATTGTCGA TGAATGCCAC
CGCATCGGTG ACGATGAAGA CAGTCAGTAT CAGCAAATCC TCACTCACCT GAGTAAAGTT
AATCCTCACT TACGTCTGCT TGGACTCACC GCCACGCCTT TTCGCCTCGG AAAAGGCTGG
ATTTATCAAT TTCATTATCA CGGTATGGTG CGCGGCAACG ACAACGCTCT GTTTCGCGAC
TGTATTTATG AACTGCCGCT GCGCTATATG ATTAAACACG GCTATCTGAC GCCGCCTGAG
CGGCTTGATA TGCCAGTGGT CCAATACGAT TTCAGCCGCC TGCAGGCCCA AAGCAATGGG
CTGTTCAGCG AAGCCGACCT GAACCGCGAG CTGAAAAAGC AGCAGCGGAT TACGCCGCAC
ATCATCAGCC AGATTATGGA ATTTGCGCAA ACGCGCAAAG GCGTGATGAT TTTCGCCGCC
ACGGTCGAAC ATGCGAAAGA GATTGTCGGT CTGCTTCCGG CGGACGACGC GGCGCTGATT
ACCGGCGATA CGCCAGGGCC CGAGCGCGAC GCGCTGATTG ATAATTTCAA GGCGCAGCGT
TTTCGCTATC TGGTTAACGT CTCGGTGCTG ACCACCGGCT TTGACGCCCC ACACGTTGAT
CTCATCGCGA TTCTACGTCC CACGGAGTCA GTTAGTCTTT ACCAACAAAT TGTCGGGCGT
GGTCTGCGCC TTGCGCCGGG AAAGACCGAT TGCCTGATTC TTGATTACGC AGGCAACCCG
CACGACCTGT ATGCCCCGGA GGTCGGTAGC CCGAAGGGAA AAAGCGATAA CGTCCCTGTC
CAGGTATTTT GCCCGGCCTG CGGCTTTGCC AACACCTTCT GGGGGAAAAC CACTGCCGAC
GGCACGCTGA TTGAACACTT TGGCCGTCGC TGCCAGGGCT GGTTTGAGGA TGACGACGGC
CATCGCGAGC AGTGCGATTT TCGCTTTCGC TTCAAAAACT GCCCGCAGTG TAATGCCGAA
AACGATATTG CTGCCCGACG CTGCCGGGAA TGTGACGCCA TTCTGGTCGA CCCGGACGAT
ATGTTAAAAG CGGCGCTCAG GCTCAAGGAT GCGTTAGTCC TGCGCTGTAG CGGAATGACG
ATGCAGCATG GGCAGGATGA GAAAGGCGAA TGGCTGAAAA TCACTTACTA TGACGAGGAC
GGCGCGGATG TCAGTGAGCG CTTCCGCTTG CACACGCCCG CCCAGCGTAC CGCTTTCGAA
CAGCTATTTA TTCGCCCGCA TACGCGCACG CCTGGCGTTC CTTTACGCTG GATCACGGCG
GCGGATATTG TCGCGCAGCA GGCGCTGTTG CGGCATCCCG ATTTTGTGGT CGCGCGGATG
AAAGGCCAGT ACTGGCAGGT GCGTGAAAAA GTGTTCGACT ATGAAGGCCG CTTCCGCCGG
GCGCACGAAT TACGTGGTTA A
 
Protein sequence
MIFTLRPYQQ EAVDATLSHF RRHRTPAVIV LPTGAGKSLV IAELARVARG RVLVLAHVKE 
LVAQNHAKYC ALGLEADIFA AGLKRKESQG KVVFGSVQSV ARNLDAFQEE FSLLIVDECH
RIGDDEDSQY QQILTHLSKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGNDNALFRD
CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH
IISQIMEFAQ TRKGVMIFAA TVEHAKEIVG LLPADDAALI TGDTPGPERD ALIDNFKAQR
FRYLVNVSVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP
HDLYAPEVGS PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG
HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDAILVDPDD MLKAALRLKD ALVLRCSGMT
MQHGQDEKGE WLKITYYDED GADVSERFRL HTPAQRTAFE QLFIRPHTRT PGVPLRWITA
ADIVAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRR AHELRG