Gene SbBS512_E2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2885 
SymbolxseA 
ID6269432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2683679 
End bp2685049 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641726827 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001881300 
Protein GI187731397 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000248025 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTT TGGATCAGCG GCGAAATCTC TAATTTCACA
CAACCGGCTT CCGGTCACTG GTACTTTACA CTCAAAGACG ACACCGCCCA GGTACGCTGC
GCGATGTTCC GCAACAGCAA CCGCCGGGTG ACCTTCCGTC CACAGCATGG ACAACAAGTT
TTAGTTCGCG CCAATATTAC GCTCTACGAG CCGCGCGGTG ACTACCAGAT AATCGTTGAG
AGTATGCAGC CGGCCGGTGA AGGGCTGCTG CAACAGAAGT ACGAACAGCT CAAAGCGAAG
TTGCAGGCTG AAGGTTTGTT CGATCAGCAA TACAAAAAAC CACTTCCCTC CCCTGCGCAT
TGTGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCTTCTCT ACCGGTGATC ATCTACCCCA CCGCCGTTCA GGGCGATGAC
GCACCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGCGCAATGA GTGCGACGTG
TTGATCGTTG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CGATTTTTGC CAGCCGCATT CCGGTCGTCA GCGCCGTCGG GCATGAGACG
GATGTGACCA TTGCCGATTT TGTTGCCGAT CTGCGTGCGC CAACGCCGTC TGCCGCCGCT
GAAGTAGTGA GCCGTAATCA GCAAGAGTTA CTGCGCCAGG TGCAATCGAC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGCACGC GTCGCTTTAC GCAGATCCAT
CACCGATTAC AGCAACAGCA TCCGCAGCTC CGGCTGGCAC GCCAGCAAAC CATGCTTGAG
CGCCTGCAAA AGCGGATGAG CTTTGCGCTG GAAAATCAAC TTAAGCGTAC CGGGCAACAG
CAGCAGCGGT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA CCCTGCGCGT ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCACCTCG AAGCCGTAAG CCCACTGTCA
ACGCTGGCGC GTGGATACAG CGTTACTACT GCTACTGACG GCAATGTACT GAAAAAAGTG
AAGCAAGTTA AAGCGGGTGA AATGCTAACC ACACGTCTGG AAGACGGCTG GATAGAAAGT
GAAGTTAAAA ACATCCAGCC AGTAAAAAAA TCGCGTAAAA AGGTGCATTA A
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDQQ YKKPLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPTAVQGDD
APGQIVRAIE LANQRNECDV LIVGRGGGSL EDLWSFNDER VARAIFASRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSTRQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLQKRMSFAL ENQLKRTGQQ QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAETLRVQLS ATRERFGNAV THLEAVSPLS TLARGYSVTT ATDGNVLKKV
KQVKAGEMLT TRLEDGWIES EVKNIQPVKK SRKKVH