Gene EcHS_A3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3572 
Symbol 
ID5594551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3550377 
End bp3551681 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content53% 
IMG OID640922689 
Producthypothetical protein 
Protein accessionYP_001460170 
Protein GI157162852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTGT ATATTCAGAT TATCGTGGTG GCGTGCCTGA CGGGTATGAC ATCGCTTCTG 
GCGCATCGCT CGGCGGCTGT TTTTCATGAC GGCATCCGCC CGATCCTGCC GCAACTGATT
GAAGGCTATA TGAACCGTCG CGAGGCGGGG AGTATCGCTT TTGGTCTGAG CATTGGTTTT
GTGGCCTCGG TGGGGATCTC TTTTACCCTG AAAACCGGGC TGCTAAACGC ATGGTTACTC
TTTCTTCCTA CCGATATCCT CGGCGTACTG GCGATAAACA GCCTGATGGC GTTTGGTCTT
GGCGCTATCT GGGGCGTGTT GATCCTTACT TGCCTGTTGC CAGTAAACCA GCTGCTGACC
GCGCTGCCGG TGGATGTATT AGGTAGCCTC GGGGAATTAA GCTCGCCGGT GGTTTCTGCT
TTTGCACTCT TCCCGTTGGT GGCGATTTTC TACCAGTTTG GCTGGAAGCA AAGTCTGGTC
GCCGCCGTGG TTGTTCTGAT GACCCGTGTG GTAGTCGTGC GCTATTTCCC ACATCTTAAC
CCTGAATCCA TCGAAATCTT TATTGGCATG GTGATGCTGC TGGGAATCGC GATAACTCAC
GACCTGCGTC ATCGTGATGA AAATGACATC GATGCCAGCG GGCTTTCGGT GTTTGAAGAG
CGCACGTCGC GGATTATCAA AAACTTACCC TATATCGCCA TCGTGGGAGC ATTGATTGCC
GCCGTTGCCA GTATGAAGAT TTTCGCCGGC AGTGAAGTGT CGATCTTCAC TCTGGAGAAA
GCGTACTCCG CAGGCGTAAC GCCGGAACAA TCGCAAACGC TGATCAACCA GGCTGCTCTG
GCGGAGTTTA TGCGCGGACT GGGTTTTGTG CCGTTGATTG CCACCACCGC GTTAGCCACC
GGCGTATATG CAGTTGCGGG CTTTACCTTT GTTTATGCGG TGGGCTATCT CTCGCCGAAT
CCGATGGTTG CGGCGGTATT AGGCGCAGTG GTTATTTCGG CAGAAGTCCT GTTACTTCGT
TCGATCGGCA AATGGCTGGG GCGCTATCCG TCGGTGCGTA ATGCGTCGGA TAACATCCGT
AACGCCATGA ATATGCTGAT GGAAGTGGCG CTGCTGGTCG GTTCGATTTT CGCAGCAATT
AAAATGGCGG GTTATACCGG ATTCTCTATC GCGGTTGCCA TTTACTTCCT CAACGAATCC
CTGGGCCGTC CGGTACAGAA AATGGCGGCA CCGGTCGTGG CCGTAATGAT CACCGGTATT
CTGCTGAATG TTCTTTACTG GCTTGGCCTG TTCGTTCCGG CTTAA
 
Protein sequence
MDLYIQIIVV ACLTGMTSLL AHRSAAVFHD GIRPILPQLI EGYMNRREAG SIAFGLSIGF 
VASVGISFTL KTGLLNAWLL FLPTDILGVL AINSLMAFGL GAIWGVLILT CLLPVNQLLT
ALPVDVLGSL GELSSPVVSA FALFPLVAIF YQFGWKQSLV AAVVVLMTRV VVVRYFPHLN
PESIEIFIGM VMLLGIAITH DLRHRDENDI DASGLSVFEE RTSRIIKNLP YIAIVGALIA
AVASMKIFAG SEVSIFTLEK AYSAGVTPEQ SQTLINQAAL AEFMRGLGFV PLIATTALAT
GVYAVAGFTF VYAVGYLSPN PMVAAVLGAV VISAEVLLLR SIGKWLGRYP SVRNASDNIR
NAMNMLMEVA LLVGSIFAAI KMAGYTGFSI AVAIYFLNES LGRPVQKMAA PVVAVMITGI
LLNVLYWLGL FVPA