Gene EcE24377A_2230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2230 
Symbol 
ID5589405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2193020 
End bp2194984 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content53% 
IMG OID640925896 
Productphage terminase large subunit (GpA) 
Protein accessionYP_001463296 
Protein GI157155074 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGGC AGACATACGA GTCGACTTCA GCGATTACGA TGTCAGCGAT ATCGAACGCG 
ATCTCGACAG GGTTGAATCC CTTGCGCGTG ACGATACCGA TGACGGGGGT TGAGTGGGCT
GATAAATATT TTTATCTCCC TGAGGGCTCC AGCCACATCG CTGGCCACTG GACGACTCAG
CCGGTCCAGG TAGTGATGCT CAATATGATG ACTAACGACG CGATAAAAAT CGTGTCTGTT
CGCAAATCAG CTCGTCTCGG TTATACAAAA ATACTCGTCG CGGCGCTGCT CTATTTCGCT
GAGCACAAAA AACGTAGTGC CGTGGTCTAT CAGCCTATCG ATGACGAATC GGATGGATTT
GTTGCCGACG AGGTTGACCC CGCTATCGCC GAAATGCCGG TGATTCAGAA AATTTTCCCC
GACTGGGATA AAAGCAACGA GCGTAACAAT CTCCAGCGTA AAGAAATGAG CGGCGCGATT
ATTGATTTTC GCGGCGCGAG TGCACCAGGA AATTTCCGGC GACTAACGAA ACAGGTTGTC
GAGGGTGACG AAGTTGACGG CTGGCCGCTG GAAGTTGCCA AAAAAGGCAA AGGCGAGGGC
TCGCCCATCG AACTGGCGCT CGTTCGAATT AAAGGGGCAG CGTACCCGAA GGCGATTTTC
GGCTCGACGC CAACCGTCAC CGGAAAAAGT CATATTGAAA TGCTGGAGGA TGCCGCTGAC
CTGACGTTTC GTTTTTACCT GAAATGTCCG CATTGTGGCG AGGAGCAGGT CCTGGTATTT
GGTTTCGACG GCATCGAATA TGGCCTCAAA TGGGATAATA GCCTGCAGAC AAATGAGGCG
AAATCGTCGT CCGCGTATTA CCAGTGCTGC CACTGTCCTG AGCATTTTTA CTATCGCGAT
CTCGAAAAAA TGGAGCTCGC GGGGCGCTGG ATAGCAGAGG ACTGCACCTG GACGCGGGAC
GGTATTCGTT TTTTTGATCA CGACGGTGGT GTCGTTCGCG CGCCGAAACA CGCGGCGATC
GTGATAAACG CCATGTATTC GCTGAATCTC GACGGCTGGG GCGAGATTGT CAGCGAGTGG
CTGAAAGCGA AGGGCGATCC GCTCAAAGAA AAGACGTTTC ATAACACGAC GCTCGGCGAA
CTCTGGAGTG ACGTAGCCAG CGAGCAGCTG GAGCACGATA TTCTGGTTAA TCGCCGGGAA
AAATACGCCA GCCAGGTTCC TGACGGTGTT GTTTATCTGA CCGGCGGCAT CGACTCTCAG
ACGTCCGGTC GCTACGAGTG TTACGTGTGG GGCTGGGGAG TGGAGGAGGA GTGCTGGCTG
ATTGATAAAA CAATCGTCCT CGGTCGCTAC GACGAGGAGG ACACGCTGCA GCGCGTCGAC
GGAGTGATTC GCAAACAATA CCGGCGCAGC GACGGGACCA CAATCGGCGT CAGTCGCTGG
GCGTGGGATA CCGGTGGTAT AGATGCGCAG GTCGTTTATA ACCGCTCGCT GAAACTCGGT
CCGCTGTGGG TCATTCCAAT TAAAGGTGCG AGTTCATACG GTCAGCCAGT CGTAAATATG
CCGCGTACAC GTAACGCGAA TAAAGTCTAT TTGTCGTTAA TCGGTACTGA TACGGCAAAA
GATTTGCTCG CAATGCGCCT GCCGCTGGAA CCCGATTCTA AATCGGCGAC ACCAAGTGCG
ATTCATTTTC CCAACGACGA CGAAATATTC GGCACGACAG AGGCAAAACA GCTCGTCTCT
GAAGTTCTGA TCCCGAAACT GATTAACGGT CGCGTCGTTT ATCGCTGGGA CAACCAGGGC
CGGCGAAATG AGGCGCTCGA CTGCTGGGTA TACGCGCTGG CAGCGCTACG TATCAGTAAA
ATTCGTTTCC AGCTCAATCT CGAGACGCTC GCTGAGCAAC GGAAAAAATC ACAAAACAAA
CTGTCTCTCG AGGAGATGGC CAGAATGCTC GGAGGGAGCT CATGA
 
Protein sequence
MNWQTYESTS AITMSAISNA ISTGLNPLRV TIPMTGVEWA DKYFYLPEGS SHIAGHWTTQ 
PVQVVMLNMM TNDAIKIVSV RKSARLGYTK ILVAALLYFA EHKKRSAVVY QPIDDESDGF
VADEVDPAIA EMPVIQKIFP DWDKSNERNN LQRKEMSGAI IDFRGASAPG NFRRLTKQVV
EGDEVDGWPL EVAKKGKGEG SPIELALVRI KGAAYPKAIF GSTPTVTGKS HIEMLEDAAD
LTFRFYLKCP HCGEEQVLVF GFDGIEYGLK WDNSLQTNEA KSSSAYYQCC HCPEHFYYRD
LEKMELAGRW IAEDCTWTRD GIRFFDHDGG VVRAPKHAAI VINAMYSLNL DGWGEIVSEW
LKAKGDPLKE KTFHNTTLGE LWSDVASEQL EHDILVNRRE KYASQVPDGV VYLTGGIDSQ
TSGRYECYVW GWGVEEECWL IDKTIVLGRY DEEDTLQRVD GVIRKQYRRS DGTTIGVSRW
AWDTGGIDAQ VVYNRSLKLG PLWVIPIKGA SSYGQPVVNM PRTRNANKVY LSLIGTDTAK
DLLAMRLPLE PDSKSATPSA IHFPNDDEIF GTTEAKQLVS EVLIPKLING RVVYRWDNQG
RRNEALDCWV YALAALRISK IRFQLNLETL AEQRKKSQNK LSLEEMARML GGSS