Gene SeD_A4939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4939 
Symbol 
ID6874898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4772922 
End bp4774763 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content43% 
IMG OID642787808 
ProductDNA helicase II 
Protein accessionYP_002218401 
Protein GI198244016 
COG category[R] General function prediction only 
COG ID[COG3972] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.891198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.619317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA TTATTCCCAC CGTAAGTAGC TGTAGCGAGA AAATTACAGC AGGCGAAAAA 
AGGCTGGCAA GGCTACTGGA GAGAGGGTTA AGCGATAGCT GTACCTGCTG GTACGACACG
CCGATGGGCA AACAACATAG TCATCCTGAC TTTGTTATTC TGACCCCCGA CAAAGGTATA
TTATTCATTG AAGTTAAGGA CTGGTTTATT ACAAAAATCA AAGGTGCTAA TAAAACATAT
GTCCAGTATG AAACTAAAAA TGGTATAGAA AGTTTAAAAA ATCCTATTGA GCAAGTTCGT
CAGTATGCTT TCCAGACCAT TAATAGCCTC AAAACAGATC CACAACTTCG CCAGCAAGAA
GGCCAATATC AGGGCAGTTT TGTTATGCCG TATGGATACG GCGTGTATCT GTCCAATATT
TCTCGCTCTC AGCTTGAGAA AGCCTTTTCA CCTACCGAAC TTGCAGACAT ACTCCCTACC
GATAAGGTGA TATGTAAGGA TGAACTCAAT GAATTTATGT CTCAGGAAAA AGTAGCTGCC
AGGCTCGCAT TACTTGCCAA ATACAACTTT GCTCACCAGA CTACACCTCA GCAGCTGGAT
CGTATAAGAT GGCATCTTTA TCCTGATATT CGAATCCATA AACCCATTAA GCAAGCTGAT
GTAAAGAATT TCACACTCCA CACACCAAGT ATTATATCCA TTATGGACCG GCAACAGGAA
CAACTTGCAA GAAGCATGGG GTCCGGACAT CGGGTGATTC ATGGCGTTGC AGGATCTGGT
AAATCACTGA TATTACTTCA CCGTTGCCTC GAACTTGCAA ATAATATTGA AAACGAAAAA
CCTGTTTTGG TCATTTGCTA CAACATTACG CTCGCCAGAA AACTCAAAGC CCTTATTGAG
AAACATACGC TCAGGCTTCC TGTTGAGGTC ACACATTTTC ATTTATGGTG TCATCAACAA
TTACAGTCTT ATGGACGGCT ACCGCCAAAA AGCAAAAACT TTGTTGAGCT AATGGAAAAT
GCCCTTTCTA TCGCCTTTGA GGATGGAATC ATCCAGCCAG AGCAGTACAG CGCCGTGCTG
ATCGACGAAG GCCACGATTT TAATCCTGAG TGGCTAAGAA TTCTTACCCG AATGGCAGAC
ACCAAAAATA ACACACTGCT TTTCCTTTAT GATGATGCAC AATCAATTTA TCAGAAAAAG
AAAGCGCTCG ACTTCACCTT ATCCAGCGTC GGCATTAAAG CTCAGGGGCG AACGACAATC
CTGAATATTA ATTACCGTAA TACTCAGCAA ATATTACATT TCGCTAGTAG CATTGCATTT
AACTATCTCA ATAATCACAT TGAGGATGCT CTTAAATATC AGCAGCCCGA CTCAGGCGGA
ATTGCGGGTT CTTATCCTGA GTTGACCTGT TTTGATAACC AGGATGAGGA AATAACGCAT
ATCCTTAACT GGGTTATCGA ACAGCACCAG CGGGGCATTC CGTGGTTTGA TATTGCCATA
TTGTGCCCAT CAACCTATAG CATTAAAGAC GTGCTGGGAC CACAACTGAC GGCTCGCAAT
ATCCCTTATC AAATGATTAT TACCTCTGCC GATAAAAAAA ACTGGTCGCC ACAGCAAGAA
CGGCTGTGTG TTATGCCTCT GCCGAGCAGT AAAGGACTCG AATTTCAGTC AGTCGCTGTC
ATGGACGCAG CCAGATCCAA AAATGAAGAA GACTTGAGCA ATGATATCAA AAGGCTTTAC
GTTGGCTTCA CACGGGCTCG CTATAATCTG CTGATAACTA TGAATGGAGA AAATGCACTC
AGCGAGCATC TTCTCACTAC CTATAAGCAG ATAACACGAT AG
 
Protein sequence
MAIIIPTVSS CSEKITAGEK RLARLLERGL SDSCTCWYDT PMGKQHSHPD FVILTPDKGI 
LFIEVKDWFI TKIKGANKTY VQYETKNGIE SLKNPIEQVR QYAFQTINSL KTDPQLRQQE
GQYQGSFVMP YGYGVYLSNI SRSQLEKAFS PTELADILPT DKVICKDELN EFMSQEKVAA
RLALLAKYNF AHQTTPQQLD RIRWHLYPDI RIHKPIKQAD VKNFTLHTPS IISIMDRQQE
QLARSMGSGH RVIHGVAGSG KSLILLHRCL ELANNIENEK PVLVICYNIT LARKLKALIE
KHTLRLPVEV THFHLWCHQQ LQSYGRLPPK SKNFVELMEN ALSIAFEDGI IQPEQYSAVL
IDEGHDFNPE WLRILTRMAD TKNNTLLFLY DDAQSIYQKK KALDFTLSSV GIKAQGRTTI
LNINYRNTQQ ILHFASSIAF NYLNNHIEDA LKYQQPDSGG IAGSYPELTC FDNQDEEITH
ILNWVIEQHQ RGIPWFDIAI LCPSTYSIKD VLGPQLTARN IPYQMIITSA DKKNWSPQQE
RLCVMPLPSS KGLEFQSVAV MDAARSKNEE DLSNDIKRLY VGFTRARYNL LITMNGENAL
SEHLLTTYKQ ITR