Gene EcE24377A_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1049 
Symbol 
ID5587806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1071486 
End bp1073474 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content51% 
IMG OID640924753 
Productbacteriophage Mu transposase MuA 
Protein accessionYP_001462167 
Protein GI157159157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT GGTATACAGC AAAAGAGTTG CTCGGTTTGG CAGGTTTACC AAAGCAAGCC 
ACTAACATTA CACGTAAGGC ACAAAGAGAA GGCTGGGAGT TCAGGCAGGT TGCAGGGACT
AAAGGTGTAT CATTTGAATT CAATATCAAA TCATTCCCTG TCGCATTACG TGCTGAAATT
CTGTTGCAAC AAGGGAGAAT TGAAACAAGT CAGGGGTATT TTGAAATCGC CCGCCCCACG
CTGGAAGCCC ATGATTATGA TCGTGAGGCA CTGTGGAGCA AATGGGATAA CGCCAGCGAT
TCCCAGCGCA GACTTGCTGA AAAATGGTTG CCTGCGGTTC AGGCTGCAGA CGAAATGCTG
AACCAGGGGA TTTCAACAAA AACGGCTTTT GCGACCGTTG CTGGGCATTA CCAGGTCAGC
GCATCCACTT TGCGGGACAA GTATTACCAG GTACAGAAGT TTGCGAAGCC TGACTGGGCG
GCTGCGCTTG TTGATGGACG TGGTGCATCC CGTCGCAACG TTCACAAAAG TGAATTTGAC
GAGGATGCCT GGCAGTTTCT GATTGCAGAT TATCTGCGAC CGGAAAAGCC CGCCTTCCGC
AAATGTTATG AGCGTCTGGA ACTGGCAGCC CGCGAGCATG GCTGGAGTAT TCCCTCCCGT
GCCACGGCCT TTCGCCGGAT TCAGCAACTG GACGAGGCAA TGGTTGTTGC CTGTCGTGAA
GGTGAACATG CACTGATGCA TCTGATACCG GCACAGCAGC GAACTGTGGA ACACCTGGAC
GCCATGCAGT GGATCAACGG CGACGGTTAT CTGCATAACG TCTTTGTACG CTGGTTTAAC
GGTGATGTGA TCCGCCCGAA AACATGGTTC TGGCAGGATG TGAAAACCCG AAAAATTCTG
GGCTGGCGCT GCGATGTAAG CGAGAACATC GACTCAATTC GCCTCTCGTT TATGGATGTG
GTGACACGCT ACGGCATCCC GGAGGATTTT CACATCACCA TTGATAACAC CCGTGGTGCA
GCGAATAAAT GGCTGACGGG GGGCGCGCCC AATCGTTACC GCTTTAAGGT AAAAGAGGAC
GATCCAAAGG GACTGTTTTT ACTGATGGGC GCGAAAATGC ACTGGACAAG CGTTGTTGCC
GGTAAAGGCT GGGGCCAGGC AAAACCTGTT GAACGTGCTT TCGGTGTTGG TGGGCTTGAG
GAATACGTTG ATAAGCATCC GGCACTGGCT GGCGCATATA CGGGGCCAAA TCCGCAGGCA
AAACCTGATA ACTATGGCGA CCGCGCTGTT GATGCAGAGC TGTTTCTGAA AACCCTTGCC
GAAGGTGTGG CGATGTTCAA TGCCAGAACA GGCCGTGAAA CAGAAATGTG CGGAGGCAAA
CTTTCGTTTG ATGACGTTTT TGAGCGTGAA TACGCCAGAA CGATTGTGCG TAAGCCTACC
GAAGAGCAAA AACGGATGCT GTTACTGCCT GCCGAGGCGG TGAACGTTTC ACGTAAAGGC
GAGTTCGCGC TTAAAGTTGG CGGCTCCCTT AAAGGTGCGA AAAACGTTTA TTACAACATG
GCGTTAATGA ATGCCGGAGT GAAAAAAGTT GTGGTCAGAT TTGATCCACA GCAGTTACAC
AGCACGGTTT ATTGCTACAC CCTGGACGGT CGGTTTATCT GTGAAGCGGA ATGTCTGGCA
CCTGTTGCGT TTAATGATGC TGCGGCAGGC CGTGAATATC GCCGCCGCCA GAAACAACTG
AAATCTGCGA CGAAAGCAGC GATTAAGGCA CAAAAACAAA TGGATGCACT GGAAGTGGCA
GAGCTGCTGC CGCAGATAGC CGAACCTGAA GCACCAGAAT CACGAATTGT CGGCATTTTC
CGGCCTTCCG GTAATACGGA ACGGGTGAAG AATCAGGAGC GTGATGATGA ATACGAAACT
GAGCGTGATG AATATCTGAA TCATTCGCTG GATATTCTGG AACAGAACAG ACGTAAAAAA
GCCATTTAA
 
Protein sequence
MKEWYTAKEL LGLAGLPKQA TNITRKAQRE GWEFRQVAGT KGVSFEFNIK SFPVALRAEI 
LLQQGRIETS QGYFEIARPT LEAHDYDREA LWSKWDNASD SQRRLAEKWL PAVQAADEML
NQGISTKTAF ATVAGHYQVS ASTLRDKYYQ VQKFAKPDWA AALVDGRGAS RRNVHKSEFD
EDAWQFLIAD YLRPEKPAFR KCYERLELAA REHGWSIPSR ATAFRRIQQL DEAMVVACRE
GEHALMHLIP AQQRTVEHLD AMQWINGDGY LHNVFVRWFN GDVIRPKTWF WQDVKTRKIL
GWRCDVSENI DSIRLSFMDV VTRYGIPEDF HITIDNTRGA ANKWLTGGAP NRYRFKVKED
DPKGLFLLMG AKMHWTSVVA GKGWGQAKPV ERAFGVGGLE EYVDKHPALA GAYTGPNPQA
KPDNYGDRAV DAELFLKTLA EGVAMFNART GRETEMCGGK LSFDDVFERE YARTIVRKPT
EEQKRMLLLP AEAVNVSRKG EFALKVGGSL KGAKNVYYNM ALMNAGVKKV VVRFDPQQLH
STVYCYTLDG RFICEAECLA PVAFNDAAAG REYRRRQKQL KSATKAAIKA QKQMDALEVA
ELLPQIAEPE APESRIVGIF RPSGNTERVK NQERDDEYET ERDEYLNHSL DILEQNRRKK
AI