Gene EcSMS35_2306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2306 
Symbolnfo 
ID6145278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2336551 
End bp2337408 
Gene Length858 bp 
Protein Length285 aa 
Translation table11 
GC content52% 
IMG OID641617180 
Productendonuclease IV 
Protein accessionYP_001744353 
Protein GI170681734 
COG category[L] Replication, recombination and repair 
COG ID[COG0648] Endonuclease IV 
TIGRFAM ID[TIGR00587] apurinic endonuclease (APN1) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0013882 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATACA TTGGAGCGCA CGTTAGTGCT GCTGGCGGTC TGGCAAATGC CGCAATTCGC 
GCCGCCGAAA TCGACGCAAC CGCGTTTGCC TTGTTCACCA AAAACCAACG TCAGTGGCGT
GCCGCACCGC TCACGACGCA AACCATCGAT GAATTCAAAG CCGCCTGTGA AAAATATCAC
TACACATCGG CGCAAATTCT TCCCCACGAC AGTTATCTGA TTAACCTCGG ACATCCGGTC
ACTGAAGCTC TGGAAAAATC GCGCGATGCC TTTATAGATG AAATGCAGCG TTGCGAACAG
CTGGGGCTTT CTTTGCTCAA CTTCCACCCT GGCAGCCATC TGATGCAGAT TTCAGAAGAG
GATTGCCTTG CGCGTATTGC CGAATCCATC AACATTGCGC TGGATAAAAC TCATGGTGTG
ACAGCGGTGA TTGAAAACAC CGCCGGTCAG GGCAGTAACT TAGGGTTTAA ATTCGAACAT
CTCGCGGCGA TTATCGACGG CGTGGAAGAT AAATCCCGCG TCGGCGTCTG CATTGATACC
TGCCATGCTT TCGCTGCCGG GTATGATTTG CGTACTTCTG CCGAATGCGA GAAAACGTTC
GCGGATTTTG CCCGTATTGT CGGCTTTAAG TATCTGCGCG GGATGCACCT TAACGATGCG
AAAAGCACCT TTGGCAGCCG CGTTGACCGC CATCATAGCC TCGGTGAAGG CAATATCGGT
CATGATGCGT TCCGCTGGAT CATGCAGGAC GACCGTTTCG ACGGCATTCC GCTGATCCTT
GAAACCATCA ACCCGGATAT CTGGGCAGAA GAGATCGCCT GGCTGAAAGC GCAACAAACT
GAAAAAGCGG TAGCCTGA
 
Protein sequence
MKYIGAHVSA AGGLANAAIR AAEIDATAFA LFTKNQRQWR AAPLTTQTID EFKAACEKYH 
YTSAQILPHD SYLINLGHPV TEALEKSRDA FIDEMQRCEQ LGLSLLNFHP GSHLMQISEE
DCLARIAESI NIALDKTHGV TAVIENTAGQ GSNLGFKFEH LAAIIDGVED KSRVGVCIDT
CHAFAAGYDL RTSAECEKTF ADFARIVGFK YLRGMHLNDA KSTFGSRVDR HHSLGEGNIG
HDAFRWIMQD DRFDGIPLIL ETINPDIWAE EIAWLKAQQT EKAVA