Gene EcHS_A4532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4532 
Symbol 
ID5594340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4538288 
End bp4540363 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content46% 
IMG OID640923628 
ProductATPase 
Protein accessionYP_001461068 
Protein GI157163750 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.228072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTTG TAGATAGTGT CGAAGCAGGT AGGCTAACGA TTAGTGAATT GATTGATGCC 
CTGGCGAAAG ATAAAAATTA CACAGCTTCC AGATGGTATC AGCGATACCG TGCATTTACG
ACTTTGCTAC AGCAAACCTC AACTTTTGCT GAGCCTGCAA CAGATGGTCT GGTCAAACAG
CTTTGGTATG AGCGTGACAA CGGCATTGCA AGTATTCGCC AGGGCGTTCC ATCCTTAGCA
GAATATCAGC AAAGCCTGCC ACTGCTTAGA GAACTAACTG AACGAATTCG GCAACAGCCG
GATGAAGAAA CTTACCAATA TGTTGGCAAT GCACTTCAAC AAGCTAAAGA AAACGGACTT
CTCAAGCGTA TGTATTGGAG TTTGAGAAAT CGCGTCTTTG CCGCGTTCTC GCCAGAAAAC
TACACCAGTA CTGTGGATGA GAATGCTTTT AATAAAGCAG CAGAATTCTT AAATCAGCAC
TTCCATCTCG GTTTGGTACT GACCGGAAAT TGGTTACAGA AAAACTATGA ATTGAAACAA
GCCATACACG CCCAATCTCC TGATACAGAT CCTTATTATG TGAATATGGC CATCTGGCAT
CTCTATGAAT TGCTCCGTGA ACGCGATAAT GAACAAAAGC AGGAGAAAGT AGCTAGCACT
ACATCCATAA CCCGCAGTGA GCCCATCGAG AACAAGATCA TCCTACATTC ACCAACTAAC
GTGATCTTCT TTGGCCCCCC TGGCACTGGC AAGACCTTCA GGTTGCAGCA AAAAATGAAA
GAGTACACTT CTCATGCTGT TCCCGCTGAT CGTGATGCCT GGCTGGATTC TCGCCTTGAA
TCGTTGAACT GGATGCAGGT TATAACGCTG GTGCTGCTCG ATCTTGGGAA ACGAGCGAAA
GTTCGCCAAA TTATTGAACA TATGTGGTTT CAACGTAAGG CATTATTAAA CGGTCGTAAT
GGCAATCTAT CGAATACTGC CTGGGCAGCT TTGCAATCCT ATACAGTTCC CGAGTCGTTA
ACCGTTGATT ATAAGAATCG GCGTGAGCCT GCCGTATTTA ACAAAACAGA TAACAGCGAA
TGGTTTCTAG TTGATTCACA GCTCGAGCAA GTGGAGGATT TGGTAGAGCT CTACGCCGAA
CTTAAACGTG GCCCTAAATC TGCCGAAGCC ATCCAGCGTT TTGCGGTGGT TACGTTCCAC
CAATCTTACG GCTATGAAGA ATTTATTGAA GGTATACGCG CTCGCTCTGA CGAGAGTGGC
AATATCTCTT ATCCCATTGA GCCGGGTATC TTTATGCGCC TTTGCCAACG TGCGAATGCC
GATCCAGGAC ATCGCTACGC CATTTTCATT GATGAGATCA ATCGCGGTAA CATATCCAAG
ATCTTTGGTG AACTAATCTC ACTCATTGAA GTAGACAAGC GTGCAGGCAT GCCCAATGCG
ATGAGCCTGC AACTGGCTTA TAGCGGTGAT CACTTCAGCG TACCCGGCAA TGTCGATATC
ATCGGAGCCA TGAATACAGC GGACCGTTCT TTAGCTCTGA TGGACACGGC TTTGCGCCGT
CGCTTTGACT TTGTCGAAAT GATGCCTGAT CTCTCTTTAC TGAGTGAAGC TAAGGTGAAA
GGCATAGAGC TCGAGTCGTT GTTAGAGAAA CTCAATAGCC GCATCGAGGC TCTTTACGAT
CGTGAACATA CGCTGGGGCA TGCGTTCTTT ATGCCGGTAA AAAATGCACT CGATGCCGGT
GATGAAGAAG CTGCGTTTAA ACAATTGAAG ATCGCATTCC AGAAAAAGAT CATTCCGCTT
TTACAGGAAT ACTTTTTCGA TGACTGGAAC AAGATCCGGT TGGTGCTGGC AGACAATCAA
AAGCAAGACG ACAACCTGCA ATTCGTGATT GAGAAAACCG ACGATCTCGA TACGCTTTTT
GGTAACAACC ATGGTTTACG ACGCCATGAT CAGCAATCAA CAGCTTATGA GCTCAAAGAT
TTCGATCAAG AGATCTGGAA TATTCCACAG GCTTATCGTT CAATTTATCA GCCCCAACAG
ACTCCCCTTG ATGAGCAGGC AGTAAATCAT GGGTGA
 
Protein sequence
MTLVDSVEAG RLTISELIDA LAKDKNYTAS RWYQRYRAFT TLLQQTSTFA EPATDGLVKQ 
LWYERDNGIA SIRQGVPSLA EYQQSLPLLR ELTERIRQQP DEETYQYVGN ALQQAKENGL
LKRMYWSLRN RVFAAFSPEN YTSTVDENAF NKAAEFLNQH FHLGLVLTGN WLQKNYELKQ
AIHAQSPDTD PYYVNMAIWH LYELLRERDN EQKQEKVAST TSITRSEPIE NKIILHSPTN
VIFFGPPGTG KTFRLQQKMK EYTSHAVPAD RDAWLDSRLE SLNWMQVITL VLLDLGKRAK
VRQIIEHMWF QRKALLNGRN GNLSNTAWAA LQSYTVPESL TVDYKNRREP AVFNKTDNSE
WFLVDSQLEQ VEDLVELYAE LKRGPKSAEA IQRFAVVTFH QSYGYEEFIE GIRARSDESG
NISYPIEPGI FMRLCQRANA DPGHRYAIFI DEINRGNISK IFGELISLIE VDKRAGMPNA
MSLQLAYSGD HFSVPGNVDI IGAMNTADRS LALMDTALRR RFDFVEMMPD LSLLSEAKVK
GIELESLLEK LNSRIEALYD REHTLGHAFF MPVKNALDAG DEEAAFKQLK IAFQKKIIPL
LQEYFFDDWN KIRLVLADNQ KQDDNLQFVI EKTDDLDTLF GNNHGLRRHD QQSTAYELKD
FDQEIWNIPQ AYRSIYQPQQ TPLDEQAVNH G