Gene EcE24377A_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3331 
Symbol 
ID5588684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3346389 
End bp3348446 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content45% 
IMG OID640926965 
ProductATPase 
Protein accessionYP_001464336 
Protein GI157155277 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTTG TAGATAGTGT CGAAGCAGGC AAGCTGACGA TCAGAGAATT GATTGATGCC 
CTGGTGAAAG ATAAAAATTA CACAGCTTCC AAGTGGGAGC AGCGATACCG GGAATTTACG
ACCTTGCTAC AACAAACCTC AACTTTTGCT GAGCCTGAAA CGGATGATTT AGTCAAGAGG
CTCTGGTATG AACGTGATAA CGGTATTGCC AGTATTCGTC AGGGCGTTCC ATCATTAGCA
GAATATCAGC AAAGCCTCCC ATTGCTTAGA GAACTCACCG AACGTGTTCG TCAACAACCA
AATGAAGTTA CCTACCAATA CGTAGGCACT GCACTGCAGC AGGCCAAAAA GATCGGACAG
CTTAAACGCA TGTATTGGAG TTTAAGAAAT CGTGTCTTTG CCGCGTTCTC GCCAGAAAAC
TACACCAGTA CTGTGGATGA GAATGCTTTC AATAAAACAG CAGAATTTCT AAATCAACGC
TTCCATCTCG GTTTGGTACT GACTGGAAAT TGGTTACAGA AAAACTATGA ATTGAAACAA
GCCATACACG CCCAATCTCC TGACACAGAT CCCTATTACG TGAATATGGC TATCTGGCAT
ATCTATGAAT TGCTCCGTGA GCGCGATAAT GAACAGAAGC AAGAGAAAGT AGCAAGCAGT
GAGCCCATCG AGAACAAGAC TATTCCACAT TCACCAACTA ACGTGATCTT CTTTGGCCCT
CCAGGCACTG GCAAGACCTT CACGTTGCAG CAAAAAATGA AAGAGTATAC CTCTCATGCA
GTTCCTGCTG ATCGTGAAGC CTGGCTGGAT TCTCGTCTTG AATCGTTGAA CTGGATGCAG
GTTATAACGC TGGTGCTGCT CGATCTTGGG AAACTAGCCA AAGTTCGCCA AATTATTGAA
CATATGTGGT TTCAACGTAA GGCATTATTA AACGGTCGTA ATGGCAATCT ATCGAATACT
GCCTGGAAGG CTTTGCAAGC CTATACAATT CCCGAATCGT TAACTGTTGA TTATAAAAAT
CGGCGTGAGC CTGCCGTATT TGACAAAACG GATAACAGCG AATGGTTTCT AGTTGATTCA
CAGCTCGAGC AAGTGGAGGA TTTGTTAGCG CTCTACGCCG AACTTAAACG TGGTCCAAAA
TCTGCCGAAG CCATCCAGCG TTTTGCGGTT GTTACGTTCC ACCAATCTTA CGGCTATGAA
GAATTTATTG AAGGTATGCG TGCACGCTCT GACGAGAGTG GCAATATATC TTATCCCATT
GAGCCGGGTA TTTTTATGCG CCTTTGCCAA CGTGCTAATG CCGATCCAGC ACATCGCTAC
GCCATTTTCA TTGATGAGAT CAATCGCGGA AACATATCCA AGATCTTTGG AGAACTAATC
TCACTCATTG AAGTAGACAA GCGTGCGGGT ATGCCCAATG CAATGAGTCT GCAACTGTCT
TATAGCGGTG ATTACTTTAG CGTGCCCGCC AATGTCGACA TCATCGGAGC CATGAATACA
GCAGACCGTT CTTTAGCTCT GCTGGACACG GCTTTGCGCC GTCGCTTTGA CTTTGTCGAA
ATGATGCCAG ATCTTTCTTT ACTGAGTGGA GCTAAGGTGA AAGGCATAGA GCTTGAGTCG
TTGTTAGAAA AACTCAATAG CCGCATCGAA GCTCTTTACG ATCGTGAGCA TACACTAGGA
CATGCATTCT TTATGCCGGT AAAAAATGCT CTCGATACCG GTGATGAAGA AACTGCGTTT
AAACAATTAA AGATCGCATT TCAGAAAAAG ATCATTCCGC TTTTACAGGA ATACTTTTTC
GATGACTGGA ACAAGATCCG GTTGGTGCTG GCAGACAATC AAAAGCAAGA CGACAACCAG
CAATTCGTGA TTGAGAAAAC TGACGATCTC GATACGCTTT TTGGTAACAA CCATGGTTTA
CGACGCCATG ATCAGCAATC AACAACTTAC GAGCTCAAAG ATCTCGATCA AGGGGTCTGG
AATATGCCAA AGGCTTATCG TTCAATTTAT CAGCCACAGC GGACCACTCC TGATGAGCAG
GTCGTAAACC ATGAGTGA
 
Protein sequence
MDLVDSVEAG KLTIRELIDA LVKDKNYTAS KWEQRYREFT TLLQQTSTFA EPETDDLVKR 
LWYERDNGIA SIRQGVPSLA EYQQSLPLLR ELTERVRQQP NEVTYQYVGT ALQQAKKIGQ
LKRMYWSLRN RVFAAFSPEN YTSTVDENAF NKTAEFLNQR FHLGLVLTGN WLQKNYELKQ
AIHAQSPDTD PYYVNMAIWH IYELLRERDN EQKQEKVASS EPIENKTIPH SPTNVIFFGP
PGTGKTFTLQ QKMKEYTSHA VPADREAWLD SRLESLNWMQ VITLVLLDLG KLAKVRQIIE
HMWFQRKALL NGRNGNLSNT AWKALQAYTI PESLTVDYKN RREPAVFDKT DNSEWFLVDS
QLEQVEDLLA LYAELKRGPK SAEAIQRFAV VTFHQSYGYE EFIEGMRARS DESGNISYPI
EPGIFMRLCQ RANADPAHRY AIFIDEINRG NISKIFGELI SLIEVDKRAG MPNAMSLQLS
YSGDYFSVPA NVDIIGAMNT ADRSLALLDT ALRRRFDFVE MMPDLSLLSG AKVKGIELES
LLEKLNSRIE ALYDREHTLG HAFFMPVKNA LDTGDEETAF KQLKIAFQKK IIPLLQEYFF
DDWNKIRLVL ADNQKQDDNQ QFVIEKTDDL DTLFGNNHGL RRHDQQSTTY ELKDLDQGVW
NMPKAYRSIY QPQRTTPDEQ VVNHE