Gene SeD_A3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3772 
Symbol 
ID6871471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3616479 
End bp3617603 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content53% 
IMG OID642786741 
ProductDNA protecting protein DprA 
Protein accessionYP_002217369 
Protein GI198243264 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTA CCGAAATTTG GTTACGTTTA ATGTACGTCG GCGACCTTTA TGGCGAGGCG 
ATGTTGAACA TGGCGAATTC GCTTATTCGC CAGCCTCAGA TAAATCGCAC GCACCTTCAG
GAAGCTGGTC TTACCGCGCG GCAAGCTGAA CGTTTTTTAC AGCTTCCAGC AGGTGTGCTT
GATGAGACGT TACGCTGGCT TGAACTACCG CAGTACCATT TTCTGTGTGC GGATAGTGAA
ATTTATCCTC CCCAACTGCG TGCTATTGAC GATTATCCCG GCGCTATTTT TATTGACGGC
GATCCTGCCT GTCTGCATAC CTGCCAACTT GCCGTCGTAG GGAGCCGGAG CCACTCTTGG
TATGGGGAAC GTTGGGGACG TCTGCTTTGC GAAAGCCTTG CGAAAAGCGG TTTGACGATC
ACCAGCGGCC TTGCCCGGGG AATTGATGGC GTAGCACACA ACGCCGCGAT GAGTATGGGG
GGAAAAAGTG TAGCGGTGTT AGGAAATGGT TTGGCAAAGA TTTATCCTCG CCGACATGCC
GTGCTGGCTG AAAACTTGAT TGCCACCGGC GGCGCAGTGG TCTCAGAATT TCCGCTTTCA
ACGCCTCCGC TACCGCAACA TTTTCCTCGC AGAAACCGTA TCATCAGCGG ACTGAGCAAA
GGCGTGCTGG TAATCGAGGC GGCGTTGCGC AGCGGTTCGT TGGTGACGGC GCGCTGCGCG
CTTGAACAGG GACGGGACGT TTTTGCATTA CCAGGTCCTA TCGGTAGTCC GGGAAGCGAA
GGCACGCACT GGTTAATTAA ACAAGGGGCT ACGCTTGTGA CGACGCCGGA GGATATTCTG
GAAAATTTGC AATACGGCTT ACACTGGTTG CCAACTACGG CGGAAAATTC ACTTTATTCA
CTAAATCAGG ATGAGGCGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGC GGTAGTGGCT
CAGCTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTAAGGAGGG CAAGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MARTEIWLRL MYVGDLYGEA MLNMANSLIR QPQINRTHLQ EAGLTARQAE RFLQLPAGVL 
DETLRWLELP QYHFLCADSE IYPPQLRAID DYPGAIFIDG DPACLHTCQL AVVGSRSHSW
YGERWGRLLC ESLAKSGLTI TSGLARGIDG VAHNAAMSMG GKSVAVLGNG LAKIYPRRHA
VLAENLIATG GAVVSEFPLS TPPLPQHFPR RNRIISGLSK GVLVIEAALR SGSLVTARCA
LEQGRDVFAL PGPIGSPGSE GTHWLIKQGA TLVTTPEDIL ENLQYGLHWL PTTAENSLYS
LNQDEAALPF PELLANVGDE VTPVDVVAER AGQPVPAVVA QLLELELAGW IAAVPGGYVR
LRRASHVRRT NVFV