Gene EcHS_A0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0233 
Symbol 
ID5592000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp252272 
End bp253552 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID640919420 
Producthypothetical protein 
Protein accessionYP_001457007 
Protein GI157159689 
COG category[T] Signal transduction mechanisms 
COG ID[COG3456] Uncharacterized conserved protein, contains FHA domain 
TIGRFAM ID[TIGR03354] type VI secretion system FHA domain protein 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAG AGAAACTACA GACGCTTTCA TTGCAGGTCA TTAACGGCAG TGAGCTGGAA 
AGCGGACGGG CGGCGCGCTG TCTGTTCACA CAGCAGGGAA ATGTCGGCCA TGGCCCCGAA
TGCCACTGGT CGGTACAGGA TCGTCAGCAG AGCATTCCGG CCCAGGCTTT TACCGTTATC
CTGCACGATG GCACATTTTG TCTGCGCCCG CAGACGGCAC AACTGTGGCT GAATCAGGCA
AAAGTCACAG CAACATCAGA CCTAATACAG TTGCGCCAGG GCGATGAGAT CCAGATCGGA
CGGCTGATGG TGAGGGTTCA TCTGAACCGG GGAGATATTC CCCATTACGA TGAGGAAATG
GCCACTCCCG AAACCATCGT TACCAATCGC GATATGCTCA CGGATACCCT GCTATCAACG
GAGGGTACGC CACACTATCC GGGAATGACT CACCGGCACC AGCTTGCAGA CACCGTGGTA
AACGGTTTTT CTGCCGATCC ACTCCAGGCA CTTCAGTCCG AAAGCCTGAT TACCACGGGC
GATCCGCTTT CAGGCATTGC GGCTGTCCGG CCATCGGCAC CGCTGTCCGA TCCGGCAAGT
AATGGGGGGA TCAATACTCC GTTTATGGAT CTGCCGCCCA TTTATGCCAG CCCCGGCGAT
CGTAATGACG ACGTCTCTGC GGCAGAAATG GCGCAACGCC ACCTTGCGGT CACCCCCTTA
CTGCGCGGTC TTGGCGGCTC GCTTACCATG AGCAATTCCG ACGATGCGGA TGATTTTCTG
GAGGAGGCCG GACGAACGTT ACAGGCCGCA ATAAAAGGTC TGCTCGATTT GCAGCAGCAG
CGTAACAGCC TCTCAGACAA ACATTTGCGC CCGCTGGAAG ATAACCCGCT GCGCCTGAAC
ATGGATTACG CCACCGCGCT CGACGTAATG TTTGCCGAAG GTAAAAGCCC GGTACATCTG
GCGGCTCCCG CCGCCGTCAG TGAAAGCCTG CGCAATGTCC GCCACCACGA AGAAGCTAAC
CGGGCAGCGA TTGTGGAGTC GCTTCGTGTC CTGCTGGATG CTTTCTCACC ACAAAATCTG
CTGCGCCGCT TTGTGCAGTA CCGCCGCAGC CATGAACTGC GCCAGCCGCT GGATGATGCC
GGAGCATGGC AAATGTACAG CCATTATTAC GAAGAACTGG CCTCCGATCG CCAGCAGGGG
TTTGAGATGC TGTTTAACGA GGTCTACGCC CAAGTCTATG ACCGGGTGCT TCGTGAAAAA
CAGCGGGAGC CGGAAGCATG A
 
Protein sequence
MPEEKLQTLS LQVINGSELE SGRAARCLFT QQGNVGHGPE CHWSVQDRQQ SIPAQAFTVI 
LHDGTFCLRP QTAQLWLNQA KVTATSDLIQ LRQGDEIQIG RLMVRVHLNR GDIPHYDEEM
ATPETIVTNR DMLTDTLLST EGTPHYPGMT HRHQLADTVV NGFSADPLQA LQSESLITTG
DPLSGIAAVR PSAPLSDPAS NGGINTPFMD LPPIYASPGD RNDDVSAAEM AQRHLAVTPL
LRGLGGSLTM SNSDDADDFL EEAGRTLQAA IKGLLDLQQQ RNSLSDKHLR PLEDNPLRLN
MDYATALDVM FAEGKSPVHL AAPAAVSESL RNVRHHEEAN RAAIVESLRV LLDAFSPQNL
LRRFVQYRRS HELRQPLDDA GAWQMYSHYY EELASDRQQG FEMLFNEVYA QVYDRVLREK
QREPEA