Gene EcSMS35_2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2658 
SymbolxseA 
ID6143308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2718529 
End bp2719896 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID641617529 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001744694 
Protein GI170679853 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000423152 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.673985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTG TGGATCAGCG GCGAAATCTC TAATTTCACA
CAACCAGCTT CCGGTCACTG GTATTTTACG CTCAAAGACG ACACCGCCCA GGTGCGCTGT
GCGATGTTCC GCAACAGCAA CCGTCGGGTG ACCTTTCGCC CGCAGCACGG ACAACAAGTT
TTAGTGCGCG CCAATATTAC ACTCTATGAA CCGCGCGGCG ATTATCAGAT TATAGTCGAG
AGTATGCAGC CTGCCGGTGA AGGGTTGCTG CAACAGAAGT ACGAACAGCT TAAAGCGAAG
TTGCAGGCTG AAGGTTTGTT CGATCTGCAA TACAAAAATT CACTCCCCTC CCCTGCGCAT
TGCGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCGTCTTT GCCGGTGATC ATCTACCCCA CCGCCGTTCA GGGCGATGAC
GCGCCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGCGCAACGA GTGTGACGTA
TTAATCGTCG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CGATTTTTGC CAGCCGCATT CCGGTAGTGA GTGCCGTCGG GCATGAGACG
GATGTGACGA TTGCCGACTT TGTTGCTGAT CTGCGTGCGC CAACGCCGTC TGCCGCCGCT
GAAGTAGTGA GCCGCAATCA GCAAGAATTA CTGCGCCAGG TGCAATCGGC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGCACGC GTCGTTTTAC GCAGATCCAT
CATCGCTTGC AGCAGCAGCA TCCACAGCTC CGGCTGGCAC GCCAACAAAC CATGCTTGAA
CGCCTGCAAA AACGGATGAG CTTTGCGCTG GAAAATCAGC TTAAGCGTGC CGGGCAACAC
CAGCAGCGAT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA CCCTGCGCGC ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCATCTCG AAGCGGTGAG TCCGCTGTCC
ACACTCGCTC GCGGTTATAG CGTTACCAGC GCCGCTGATG GCGCGGTGTT AAAACAGGTT
AAGCAGGTGA AAGTGGGTGA GACACTGACC ACTCGCCTGG GCGATGGCGT AGTGATCAGT
GAAGTGAGTG CGGTGACGAA AAGCCGCAAG CCACGTAAAA AAGCCTGA
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDLQ YKNSLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPTAVQGDD
APGQIVRAIE LANQRNECDV LIVGRGGGSL EDLWSFNDER VARAIFASRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSARQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLQKRMSFAL ENQLKRAGQH QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAETLRAQLS ATRERFGNAV THLEAVSPLS TLARGYSVTS AADGAVLKQV
KQVKVGETLT TRLGDGVVIS EVSAVTKSRK PRKKA