Gene EcE24377A_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1416 
Symbol 
ID5589794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1419324 
End bp1421795 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content49% 
IMG OID640925111 
Productexonuclease family protein 
Protein accessionYP_001462518 
Protein GI157155450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000510818 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG TCTTTATTTG CGCCGCCATT CCGGACGAAC AGGCAATAAA GGAAGAAGGT 
GCCGTCGCTG TAGCCACTGC CATTGAAGCC GGTGATGAAC GTCGCGCCCG CGCAAAATTT
CACTGGCAAT TCCTGGAACA TTATCCGGCT GCTCAGGACT GCGCTTATAA ATTTCTTGTC
TGCGAGGATA AACCCGGTAT ACCCCGCCCT GCCCTCGATT CCTGGGATGC TGAATATATG
CAGGAAAACC GCTGGGATGA GGAGTCTGCT TCCTTTGTCC CGGTTGAGAC TGAAGCCGAT
CCGATGAACG TCACTTTTGA CAAGCTGGCC CCTGAAGTAC AGAACGCTGT CATGGTTAAG
TTCGACACAT GTGAAAACAT CACCGTTGAT ATGGTTATTA GCGCACAGGA ATTGTTGCAG
GAAGACATGG CAACATTCGA CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGATGTT
AACGCCATGT ATCCGGAGCT TAAGCTGCAT GCCATCGGGT GGGTTAAGCA TAAATGTAAG
CCTGGTGCCA AATGGCCCGA AATTCAGGCA GAGATGCGCA TCTGGAAAAA ACGTCGCGAA
GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCGCCCG CGCCAGAGTC
AATCAACAGC ACACTGAAAA TTCAACAGGA AAAATCAGCC TGGTCATTGC TGCCATTCAT
CGCGAATACA AGCAGACATG GAAAACACTG GATGACGAAC TGGCCTACGC TCTCTGGCCT
GGTGATGTGG ATGCCGGAAA CATTGACGGC AGCATCCATC GCTGGGCAAA AAATGAAGTT
ATCGACAACG ACCGCGAAGA CTGGAAGCGT ATCTCGGCAT CAATGCGCAA ACAGCCTGAT
GCCCTTCGCT ACGACCGCCA GACTATTTTT GGCCTTGTCC GTGAACGTCC GATCGACATT
CACAAAGACC CTGTGGCACT GAACAAATAC ATTACTGAAT ACCTGACTAC AAAGGGCGTG
TTTGAAGATG AAGGAAGAAA TCAGAGCGCA ACTGATACTC TCTCGTCGCC AGTACCAGAA
ACTGATGCAG TGGAAACGGC AATTCCGGAC AACGAAAAAA CCGAATGCAA AGTGGAAGTC
GAACCATCTG TAGAGCGTGA GGGGCCGTTC TACTTCCTCT TCACCGACAA GGATGGCGAA
AAATACGGTC GCGCAAACAA ACTTTCTGGT CTGGATAAGG CACTGGCTGC CGGGGCTACT
GAAATCACGA AAGAAGAATA TTTCGCCCAC AAAAACGGTA CATACTCAGG TTCACAACAA
AATACTGGTG CATCTGACAC GACCGCACAA CCAGGGCCGG TAAAAGTTAC CGCTGACGAA
GTAAACAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCGATAA GTTGCTTGCT
GCCTCTCGCG GAGAATTTGT TGCAGGGATT AGCGACCCGA ATGATCCGAA ATGGGTTAAG
GGGATCCAGA CCCGCGATTC TGTAAACCAG AACCAGCATG AATCGGAACG GAACTACCAA
AAAGCGGAAC AAAACAGCCC AAATGCGTTA CAAAACGAGC CAGAAACGAA ACAGCCTGAA
CCAGTGGCGC AACAGGAAGT GGAAAAAGTC TGCACCGCCT GCGGTCAGAC CGGCGGCGGC
AATTGCCCTG ATTGTGGCGC GGTGATGGGC GACGCAACAT ACCAGGAAAC ATTCGATGAA
GAGTATCAGG TTGAAGTTCA GGAAGATGAT CCGGAGGAAA TGGAAGGCGC TGAACATCCA
CACAAGGAGA ACACTGGCGG CAATCAGCAT CACGATAGCG ATAATGAAAC TGGCGAGACG
GCAGATCACT CAATTAAGGT GAACGGTCAT CACGTAATCA CATCCACCAG CAGGACGTGT
GACCATCTAA TGATAGACCT TGAAACCATG GGAAAAAATC CTGATGCCCC GATCATCTCA
ATAGGTGCAA TATTTTTCGA TCCGCAAACC GGAGATATGG GACCGGAATT TAGTAAGACT
ATCGATCTGG AAACTGCTGG CGGAGTCATT GATCGGGACA CCATTAAATG GTGGCTTAAG
CAATCACGCG AAGCGCAATC TGCCATTATG ACCGATGAAA TCCCGTTAGA TGATGCACTG
TTACAATTGC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTCTGGGGA
AATGGAGCCA ACTTCGACAA CACGATTTTG CGCCGTTCAT ACGAACGGCA GGGGATCCCC
TGCCCGTGGC GTTACTACAA CGATCGCGAT GTACGCACAA TCGTTGAGCT GGGGAAAGCC
ATAGACTTCG ATGCCAGAAC GGCTATTCCA TTCGAAGGTG AGCGCCATAA TGCACTTGAT
GACGCCCGTT ACCAGGCAAA ATACGTTTCA GTTATCTGGC AAAAACTGAT CCCGAGTCAG
GCTGATTTTT AA
 
Protein sequence
MSKVFICAAI PDEQAIKEEG AVAVATAIEA GDERRARAKF HWQFLEHYPA AQDCAYKFLV 
CEDKPGIPRP ALDSWDAEYM QENRWDEESA SFVPVETEAD PMNVTFDKLA PEVQNAVMVK
FDTCENITVD MVISAQELLQ EDMATFDGHI VEALMKMPDV NAMYPELKLH AIGWVKHKCK
PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLARARV NQQHTENSTG KISLVIAAIH
REYKQTWKTL DDELAYALWP GDVDAGNIDG SIHRWAKNEV IDNDREDWKR ISASMRKQPD
ALRYDRQTIF GLVRERPIDI HKDPVALNKY ITEYLTTKGV FEDEGRNQSA TDTLSSPVPE
TDAVETAIPD NEKTECKVEV EPSVEREGPF YFLFTDKDGE KYGRANKLSG LDKALAAGAT
EITKEEYFAH KNGTYSGSQQ NTGASDTTAQ PGPVKVTADE VNKIMQAANI SQPDADKLLA
ASRGEFVAGI SDPNDPKWVK GIQTRDSVNQ NQHESERNYQ KAEQNSPNAL QNEPETKQPE
PVAQQEVEKV CTACGQTGGG NCPDCGAVMG DATYQETFDE EYQVEVQEDD PEEMEGAEHP
HKENTGGNQH HDSDNETGET ADHSIKVNGH HVITSTSRTC DHLMIDLETM GKNPDAPIIS
IGAIFFDPQT GDMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSAIM TDEIPLDDAL
LQLREFIDEN SGEFFVQVWG NGANFDNTIL RRSYERQGIP CPWRYYNDRD VRTIVELGKA
IDFDARTAIP FEGERHNALD DARYQAKYVS VIWQKLIPSQ ADF