Gene SbBS512_E3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3670 
SymboldprA 
ID6272779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3411989 
End bp3413113 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID641727534 
ProductDNA protecting protein DprA 
Protein accessionYP_001881969 
Protein GI187731689 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.206561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATA CAGATATTTG GCTGCGTTTA ATGAGCATCA GCAGCTTGTA CGGCGATGAT 
ATGGTCCGTA TAGCTCACTG GCTGGCAAAA CAGTCGCATA TTGATGCGGT TGTATTGCAG
CAAACAGGGC TTACATTGCG GCAGGCACAA CGCTTTCTTT CATTTCCGCG AAAGAGTATC
GAAAGCTCAC TTTGTTGGTT GGAGCAACCC AATCATCATT TAATCCCTGC GGACAGCGAA
TTTTATCCTC CTCAACTTCT GGCGACGACA GATTACCCCG GCGCACTGTT TGTTGAAGGA
GAACTGCACG CGCTGCATTC ATTTCAGCTT GCCGTAGTGG GGAGTCGGGC GCATTCATGG
TATGGCGAGC GATGGGGACG ATTATTTTGC GAAACTCTGG CCACGCGTGG AGTGACAATT
ACGAGTGGAC TGGCGCGTGG AATCGATGGT GTGGCGCATA AAGCGGCCTT ACAGGTAAAT
GGCGTCAGCA TTGCTGTATT GGGGAACGGA CTTAATACCA TTCATCCGCG CCGCCATGCC
CGACTGGCTA CCAGTTTGCT TGAACATGGT GGGGCACTTG TCTCGGAATT TCCCCTCGAT
GTTCCACCCC TTGCTTACAA TTTCCCACGA AGAAATCGCA TTATCAGTGG TCTAAGTAAA
GGTGTACTGG TGGTGGAAGC GGCTTTGCGC AGTGGTTCGT TGGTGACAGC ACGTTGTGCG
CTTGAGCAGG GGCGTGAAGT TTTTGCCTTG CCAGGACCAA TAGGGAATCC GGGAAGCGAA
GGGCCTCACT GGTTAATAAA ACAAGGTGCG ATTCTTGTGA CGGAACCGGA AGAAATTCTG
GAAAACTTGC AATTTGGATT GCACTGGTTG CCAGACGCCC CTGAAAATTC ATTTTATTCA
CCAGATCAGC AAGACGTGGC ATTGCCATTT CCTGAGCTCC TGGCTAACGT AGGAGATGAG
GTAACACCTG TTGACGTCGT CGCTGAACGT GCCGGCCAAC CTGTGCCAGA GGTAGTTACT
CAACTACTCG AACTGGAGTT AGCAGGATGG ATCGCAGCTG TACCCGGCGG CTATGTCCGA
TTGAGGAGGG CATGCCATGT TCGACGTACT AATGTATTTG TTTGA
 
Protein sequence
MVDTDIWLRL MSISSLYGDD MVRIAHWLAK QSHIDAVVLQ QTGLTLRQAQ RFLSFPRKSI 
ESSLCWLEQP NHHLIPADSE FYPPQLLATT DYPGALFVEG ELHALHSFQL AVVGSRAHSW
YGERWGRLFC ETLATRGVTI TSGLARGIDG VAHKAALQVN GVSIAVLGNG LNTIHPRRHA
RLATSLLEHG GALVSEFPLD VPPLAYNFPR RNRIISGLSK GVLVVEAALR SGSLVTARCA
LEQGREVFAL PGPIGNPGSE GPHWLIKQGA ILVTEPEEIL ENLQFGLHWL PDAPENSFYS
PDQQDVALPF PELLANVGDE VTPVDVVAER AGQPVPEVVT QLLELELAGW IAAVPGGYVR
LRRACHVRRT NVFV