Gene EcE24377A_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2214 
Symbol 
ID5588034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2180371 
End bp2181519 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content42% 
IMG OID640925882 
Producthypothetical protein 
Protein accessionYP_001463282 
Protein GI157158614 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAA ATAAAATTTA CGCTGTGTTA ACTGATCGTG GCGCGCAGTT AGAAGCTGCG 
GCGCTGGCGT CAGGAGTGCC GGTACTGCTA AATAAATTCG TTATTGGTGA CGCGAACGGA
AACGACGACG TAACGCCAGA CCCGGCCCGA ACGGCATTAA TTCACGAGAC GTATCGCGGA
GATATTAAAT CGTCAGAAAA TAGCGGTAAT CAAGTCATTT TTACACTATA CGTACCACCG
GAAACCGGCG GCTATACTAT CCGTGAGGTG GGAATATTAA CCGACAAAGG TGAACTGTAC
TCTGTAGCGC GTTCGCCGGA TATTTTAAAA CCTACGGACA GTAACGGCGC ACTGATTTCA
ATCACGTATA AATACACCCT CGCGGTGTCC AGCACATCTA CTGTTAACGT AGTTATTGAT
AACAGTAGCG GAATGAACCA GGCAGATGCT GATAAGCGCT ATTTGCAGAT AAGCAAAAAT
TTATCTGAAA TTAAAAATAA GGGCGAATCC GCTCAACGAG CAGGGCGGGA GAATCTCGGT
ATTGATTTAG ACGATTATTA CGATAAAACT GAGATTGATA GCAAATTTAC TGATATTGAT
GAAGATATTA ATAATATAAA TAAAACAAAA CCTGTTCTTA CAGTAAATAA TATACAGCCT
GATGCTACTG GGAATGTAAA TACAGGTTCC GGATTTGCAA AACCAAACGG CGATGGGGCG
TTTAATCTGG TTATGTTATA CGGCGGTGAC CACGTCAGTG TTACTCCTGC GATGACAATC
GTAACAGGTT ATGATGTATC CCCGTACGCG ATAAATCCAA CCAATGTAAA CGCCGATGTG
GAAACTTATT TGTGCGGTGC GTGGATGACG TTAGCGGCAA CGAGTGGAAA TGCGTTAGTG
ATGGCGCAGC GAATTCCCAT TGGGAATATA TCAAAAATGT TAAATATACG AGATCCGTAT
TACCCGAATA AACACGGAAC CGTTAACTCA TATGATCACA ATTATATCTG CGTTAAATGT
AATATTGAAG GAATAGATAA TGACATCATA TTTACGTCAA ATCTGAAAGA CGTTGAGGAA
TATGGTGTGC AGGTTTTCCA GAACGCCAAA AATGGTATTT ACGGCACCGT TATAGACGAA
GGTCATTGA
 
Protein sequence
MAENKIYAVL TDRGAQLEAA ALASGVPVLL NKFVIGDANG NDDVTPDPAR TALIHETYRG 
DIKSSENSGN QVIFTLYVPP ETGGYTIREV GILTDKGELY SVARSPDILK PTDSNGALIS
ITYKYTLAVS STSTVNVVID NSSGMNQADA DKRYLQISKN LSEIKNKGES AQRAGRENLG
IDLDDYYDKT EIDSKFTDID EDINNINKTK PVLTVNNIQP DATGNVNTGS GFAKPNGDGA
FNLVMLYGGD HVSVTPAMTI VTGYDVSPYA INPTNVNADV ETYLCGAWMT LAATSGNALV
MAQRIPIGNI SKMLNIRDPY YPNKHGTVNS YDHNYICVKC NIEGIDNDII FTSNLKDVEE
YGVQVFQNAK NGIYGTVIDE GH