Gene EcHS_A4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4349 
Symbol 
ID5591006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4351156 
End bp4353384 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content51% 
IMG OID640923447 
Producthypothetical protein 
Protein accessionYP_001460892 
Protein GI157163574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACAC AGACCCTGTA TGAGTTAAGT CAGGAGGCTG AACGCCTGTT ACAGCTTTCT 
CGCCAACAGT TGCAGTTACT GGAAAAAATG CCTCTCTCTG TACCCGGAGA CGACGCGCCA
CAACTGGCTT TACCCTGGAG TCAGCCTAAT ATCGCCGAAC GTCACGCGAT GCTGAATAAT
GAGTTGCGTA AAATTTCCCG ACTGGAAATG GTGCTGGCAA TTGTCGGTAC CATGAAAGCA
GGGAAATCAA CCACCATTAA TGCCATTGTT GGTACGGAGG TTCTGCCTAA TCGTAATCGC
CCAATGACTG CGCTGCCGAC GCTTATTCGC CATACGCCCG GGCAAAAGGA ACCGGTACTG
CATTTTTCAC ATGTCGCGCC AATCGATTGT TTAATTCAAC AATTACAACA GCGCCTGCGT
GATTGCGATA TTAAGCATCT GACCGATGTG CTGGAAATAG ATAAAGATAT GCGTGCGCTT
ATGCAGCGGA TCGAAAATGG CGTCGCTTTC GAAAAATATT ATCTGGGTGC CCAGCCTATT
TTTCATTGTC TGAAAAGTTT GAATGATTTA GTGCGACTGG CGAAGGCGCT GGACGTCGAT
TTTCCTTTTT CTGCTTACGC CGCCATTGAG CATATTCCCG TGATTGAAGT GGAGTTTGTC
CATCTGGCGG GGCTGGAGAG TTATCCCGGT CAGTTGACGT TACTGGATAC CCCCGGGCCA
AATGAAGCCG GGCAACCGCA TCTGCAAAAA ATGCTTAACC AGCAGCTGGC ACGCGCCTCG
GCGGTACTGG CGGTGCTGGA TTATACGCAA CTGAAATCGA TCTCCGATGA AGAGGTCCGT
GAGGCGATTT TGGCGGTGGG GCAATCGGTG CCGCTGTATG TGCTGGTCAA TAAGTTCGAT
CAACAGGATC GTAACAGTGA CGACGCCGAC CAGGTGCGGG CACTGATTTC CGGGACGCTG
ATGAAAGGCT GTATTACGCC ACAGCAGATA TTTCCGGTGT CGTCGATGTG GGGCTACCTG
GCGAATCGAG CGCGCCATGA GTTAGCCAAC AACGGTAAGT TACCAGCGCC AGAGCAACAA
CGCTGGGTGG AAGATTTTGC CCATGCTGCG CTCGGCAGGC GCTGGCGTCA TGACGATCTG
GCGGACCTCG AACATATTCG TCATGCTGCC GATCAGTTGT GGGAAGATTC GCTGTTCGCC
CAGCCAATTC AGGCGTTGCT TCATGCCGCT TACGCTAACG CCTCGTTGTA TGCTCTGCGA
TCTGCTGCGC ATAAACTGTT GAATTACGCG CAGCAGGCGC GGGGATACCT GGATTTTCGT
GCGCACGGGT TAAACGTCGC TTGTGAACAA TTGCGGCAAA ATATCCATCA GGTCGAAGAA
AGTTTGCAGC TATTGCAACT CAATCAGGCG CAGGTGAGCG GCGAGGTTAA ACATGAAATC
GAGCTGGCCC TGACCTCCGC CAACCACTTT CTGCGTCAAC AGCAAGATGC GCTGAATGCG
CAGTTAGCCG CCTTGTTTCA GGATGATTCG GAGCCGTTAA GCGAGATTCG TACCTGCTGT
GAGACACTGT TACAGACGGC GCAGAACACC ATCAGTCGCG ACTTTACGCT GCGTTTTGCC
GAGCTTGAAT CCACCCTTTG CCGGGTGTTA ACCGATGTTA TTCGGCCCAT TGAGCAACAA
GTCAAAATGG AATTGAGCGA GTCAGGGTTT CGTCCTGGGT TTCATTTTCC TGTTTTTCAC
GGCGTAGTTC CCCACTTCAA CACTCGCCAG CTGTTCAGTG AAGTCATTTC GCGCCAGGAT
GCAACGGACG AGCAGAGCAC GCGTTTAGGC GTTGTGCGTG AGACTTTTTC GCGCTGGTTG
AATCAGCCCG ACTGGGGACG GGGAAATGAG AAATCCCCGA CAGAAACGGT TGATTACAGT
GTGTTGCAAC GAGCTTTAAG CGCAGAAGTC GATCTTTATT GCCAACAAAT GGCTAAAGTT
CTGGCAGAGC AGGTCGATGA ATCTGTTACG GCAGGCATGA ATACTTTTTT CGCTGAGTTC
GCTTCATGTT TGACGGAATT ACAGACGCGT TTACGCGAAA GTCTGGCTCT GCGTCAACAA
AATGAATCGG TGGTCAGGCT GATGCAGCAG CAATTGCAGC AGACTGTGAT GACTCACGGC
TGGATTTACA CCGACGCCCA GCTGTTACGC GATGATATTC AAACACTTTT CACGGCAGAA
CGATATTGA
 
Protein sequence
MYTQTLYELS QEAERLLQLS RQQLQLLEKM PLSVPGDDAP QLALPWSQPN IAERHAMLNN 
ELRKISRLEM VLAIVGTMKA GKSTTINAIV GTEVLPNRNR PMTALPTLIR HTPGQKEPVL
HFSHVAPIDC LIQQLQQRLR DCDIKHLTDV LEIDKDMRAL MQRIENGVAF EKYYLGAQPI
FHCLKSLNDL VRLAKALDVD FPFSAYAAIE HIPVIEVEFV HLAGLESYPG QLTLLDTPGP
NEAGQPHLQK MLNQQLARAS AVLAVLDYTQ LKSISDEEVR EAILAVGQSV PLYVLVNKFD
QQDRNSDDAD QVRALISGTL MKGCITPQQI FPVSSMWGYL ANRARHELAN NGKLPAPEQQ
RWVEDFAHAA LGRRWRHDDL ADLEHIRHAA DQLWEDSLFA QPIQALLHAA YANASLYALR
SAAHKLLNYA QQARGYLDFR AHGLNVACEQ LRQNIHQVEE SLQLLQLNQA QVSGEVKHEI
ELALTSANHF LRQQQDALNA QLAALFQDDS EPLSEIRTCC ETLLQTAQNT ISRDFTLRFA
ELESTLCRVL TDVIRPIEQQ VKMELSESGF RPGFHFPVFH GVVPHFNTRQ LFSEVISRQD
ATDEQSTRLG VVRETFSRWL NQPDWGRGNE KSPTETVDYS VLQRALSAEV DLYCQQMAKV
LAEQVDESVT AGMNTFFAEF ASCLTELQTR LRESLALRQQ NESVVRLMQQ QLQQTVMTHG
WIYTDAQLLR DDIQTLFTAE RY