Gene EcHS_A2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2653 
Symbol 
ID5591508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2664820 
End bp2666160 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content40% 
IMG OID640921768 
Producthypothetical protein 
Protein accessionYP_001459295 
Protein GI157161977 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00498534 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATA TCGAAAGTGA AAAGGATGTA AGTGATCGCT TAGGTATGTA CTTTATCTCT 
GTCTGGGAAG ATTGCGATAC CGTTGTTGCC GCTGGCATAC GGGAATTTCG AGAGCGATGG
GCAAACTACA CCGTACATTC AAAAAGAAAG GACCCAAGAA GCCATTTGAA GGGTATTTCA
TCCTACGACA AACGAGAAGC GGATAAGCGT CCTATTGGAG AACTTATCTT AGAGGTGATT
GATCCTAAAG TTTCAGCATT CCTTAACTCT CTTCCTGAGA GTCATTTTCA ATTCTTTCCT
ATCCCCTATA CGAATATTGC TCACTCTCCC TTACCACGTT CAAATGATAA ATTTACCAGT
ACTGAAAGCG TTCCTAGTTT TTCGCCTATT AAATATGGTT CTAAATTTAA AGAAGATACT
CCGATAACCA ATCTAGTGAT GGTTATGGGT ATCAAAAATG CTCATGATGT TATTAAACAG
CAACTAAGGC GTTTTGAAAA AACGGAACAT CAATATCGAC CTATTGATCT TAAGTTTCAA
GCAACCGTGG ATAATCTGCT TGAGCTATTA TGGCAGCTTC ACCTTACACC GACGCGCTTC
AAGCAACACA GTGCAGACAC AAAACTCAAT GCGCGACGAA AGCAAACTTT CTGCGAGCTG
TGTGGCCAAA GAAATGAACT TGCAGAATAT TTCTATAAGC TAGATAACAA TATGCTAGAA
CTGGAAGATG AGATAGAAAG TCACAACGAA CAGAATCCTG ATAATCAGAA AAAACTACAA
CTCAGCCACA GGTATTGTTC TTACCATAAA CCGAAACACA AAAATGGCTG TACGTGGAAC
TCCGCTTACA AGAGTGCTCT GCACTCAAAA GACCAATTCG AGAATGAATT GCAGAGATTG
CAACTTCACA TTGTCAAAGT CGAAGAGCTT AAAGTCATTT CTAGAGATGA ACTAGTTGAC
CTTTATTTCT ATCATTTCCT CCAAGATAAA TGCGTCACTC AGAAACAAAG TGACGCATTT
TTCCATTACG TTAGGGATAA TTTTAATTAC CCAATCGTAA TTAAGGAAGA AACAGAAAGA
CTCATTCATG AAGCAGCCGT GCGCTTGACT GGAGCTGGTA CGACTCTTGG AGCTGATGAT
GTCGGAAAAC TGCGAGATAT CGCTCGCCAC ATGGTTGACT CACGATTAAC AGATAGTAAG
AAACGAATGC TCGTTCTTAA GAAACAAGGA TTCAATCAGA GACACATTGC AGATAAGTTA
ACGGAAATTG AGGGAAGAAC TATTTCACCC CAGGCGGTTT CTAAAGCTTT GAAAAGTGTA
GATAGTAACT TTAATATTTA A
 
Protein sequence
MAYIESEKDV SDRLGMYFIS VWEDCDTVVA AGIREFRERW ANYTVHSKRK DPRSHLKGIS 
SYDKREADKR PIGELILEVI DPKVSAFLNS LPESHFQFFP IPYTNIAHSP LPRSNDKFTS
TESVPSFSPI KYGSKFKEDT PITNLVMVMG IKNAHDVIKQ QLRRFEKTEH QYRPIDLKFQ
ATVDNLLELL WQLHLTPTRF KQHSADTKLN ARRKQTFCEL CGQRNELAEY FYKLDNNMLE
LEDEIESHNE QNPDNQKKLQ LSHRYCSYHK PKHKNGCTWN SAYKSALHSK DQFENELQRL
QLHIVKVEEL KVISRDELVD LYFYHFLQDK CVTQKQSDAF FHYVRDNFNY PIVIKEETER
LIHEAAVRLT GAGTTLGADD VGKLRDIARH MVDSRLTDSK KRMLVLKKQG FNQRHIADKL
TEIEGRTISP QAVSKALKSV DSNFNI