Gene EcHS_A4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4268 
Symbol 
ID5593188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4269433 
End bp4271529 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content54% 
IMG OID640923370 
Productputative lipoprotein 
Protein accessionYP_001460815 
Protein GI157163497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GACATCTGCT TAGCTTACTG GCGCTGGGCA TTAGCACAGC TTGCTACGGC 
GAAACATATC CTGCGCCCAT TGGTCCGTCG CAGTCGGATT TCGGTGGCGT AGGATTATTA
CAAACGCCCA CCGCGCGTAT GGCACGGGAA GGGGAGTTGA GTCTGAACTA TCGCGATAAC
GATCAGTACC GTTATTACTC AGCTTCAGTG CAACTCTTCC CGTGGCTGGA AACGACGCTG
CGTTACACCG ACGTGCGCAC CCGGCAGTAC AGCAGCGTCG AAGCGTTCTC TGGTGATCAA
ACGTATAAAG ATAAAGCCTT CGATCTCAAA CTGCGCTTGT GGGAAGAGAG TTACTGGCTA
CCGCAAGTGG CGGTTGGTGC GCGGGATATC GGCGGTACGG GGCTGTTTGA TGCGGAATAT
CTTGTTGCCA GCAAAGCCTG GGGACCGTTC GATTTTACGC TCGGCCTGGG CTGGGGATAT
CTGGGTACCA GCGGTAATGT GAAAAATCCG CTCTGTTCAG CCAGTGATAA ATATTGCTAT
CGCGATAACA GCTACAAGCA GGCAGGTTCG ATCGACGGCA GTCAGATGTT CCACGGCCCG
GCATCACTGT TTGGCGGCGT GGAATACCAG ACGCCCTGGC AACCGCTGCG TCTGAAACTG
GAGTATGAAG GCAATAATTA TCAGCAGGAT TTTGCCGGTA AGCTTGAGCA AAAAAGTAAG
TTTAACGTCG GGGCGATTTA TCGCGTTACC GATTGGGCCG ACGTTAACCT TAGCTATGAA
CGTGGCAACT CCTTTATGTT TGGCGTCACG CTGCGCACCA ACTTTAACGA TCTGCGCCCG
TCTTACAACG ATAACGCCCG CCCGCAATAT CAACCGCAGC CGCAGGATGC CATTTTGCAG
CATTCGGTGG TGGCGAATCA GTTAACGCTG TTGAAATACA ATGCCGGACT TGCCGATCCG
CAGATCCAGG CGAAAGGCGA TACGCTGTAT GTCACCGGCG AGCAGGTGAA ATATCGTGAT
TCGCGCGAAG GGATCATCCG TGCTAATCGG ATCGTGATGA ACGATCTGCC GGATGGGATC
AAAACGATCC GCATTACGGA AAATCGTCTT AACATGCCGC AGGTGACGAC GGAAACCGAT
GTCGCCAGCC TGAAAAATCA TCTCGCCGGA GAGCCGTTGG GCCACGAAAC GAAGCTGGCG
CAAAAACGCG TCGAGCCAGT GGTTCCGCAG TCCACCGAGC AGGGCTGGTA TATCGACAAA
TCACGCTTTG ATTTCCACAT CGATCCGGTG CTGAACCAGT CGGTCGGTGG CCCGGAAAAC
TTTTACATGT ATCAGCTGGG CGTGATGGGA ACGGCAGATT TGTGGCTGAC GGACCATCTG
CTGACCACCG GCAGCCTGTT TGCAAATCTT GCCAACAACT ACGACAAGTT TAACTACACC
AATCCGCCGC AGGACTCGCA CTTACCGCGC GTTCGTACCC ATGTGCGCGA GTATGTGCAG
AACGATGTCT ATGTGAATAA CCTGCAAGCC AACTACTTCC AGCATCTGGG CAACGGCTTC
TACGGTCAGG TCTATGGAGG TTATCTCGAA ACCATGTTTG GCGGTGCCGG GGCAGAAGTG
TTGTATCGTC CGCTGGACAG CAACTGGGCG TTTGGTCTGG ATGCCAACTA CGTTAAACAG
CGCGACTGGC GTAGTGCAAA AGATATGATG AAATTCACCG ACTACAGCGT GAAAACCGGA
CATCTGACCG CCTACTGGAC GCCATCTTTC GCTCAGGATG TGTTAGTTAA AGCCAGCGTC
GGGCAGTATC TGGCAGGGGA TAAAGGCGGC ACGCTGGAGA TCGCCAAACG CTTTGATAGC
GGCGTGGTGG TGGGTGGCTA TGCCACGATC ACTAATGTTT CAAAAGAGGA GTACGGCGAA
GGGGACTTCA CCAAAGGCGT GTATGTCTCG GTACCGTTGG ATCTCTTCTC GTCTGGCCCG
ACACGCAGCC GTGCGGCGAT TGGCTGGACG CCGCTGACGC GTGACGGTGG TCAGCAACTT
GGGCGTAAGT TCCAGTTGTA TGATATGACC AGCGACCGTA GCGTAAATTT CCGCTAA
 
Protein sequence
MKKRHLLSLL ALGISTACYG ETYPAPIGPS QSDFGGVGLL QTPTARMARE GELSLNYRDN 
DQYRYYSASV QLFPWLETTL RYTDVRTRQY SSVEAFSGDQ TYKDKAFDLK LRLWEESYWL
PQVAVGARDI GGTGLFDAEY LVASKAWGPF DFTLGLGWGY LGTSGNVKNP LCSASDKYCY
RDNSYKQAGS IDGSQMFHGP ASLFGGVEYQ TPWQPLRLKL EYEGNNYQQD FAGKLEQKSK
FNVGAIYRVT DWADVNLSYE RGNSFMFGVT LRTNFNDLRP SYNDNARPQY QPQPQDAILQ
HSVVANQLTL LKYNAGLADP QIQAKGDTLY VTGEQVKYRD SREGIIRANR IVMNDLPDGI
KTIRITENRL NMPQVTTETD VASLKNHLAG EPLGHETKLA QKRVEPVVPQ STEQGWYIDK
SRFDFHIDPV LNQSVGGPEN FYMYQLGVMG TADLWLTDHL LTTGSLFANL ANNYDKFNYT
NPPQDSHLPR VRTHVREYVQ NDVYVNNLQA NYFQHLGNGF YGQVYGGYLE TMFGGAGAEV
LYRPLDSNWA FGLDANYVKQ RDWRSAKDMM KFTDYSVKTG HLTAYWTPSF AQDVLVKASV
GQYLAGDKGG TLEIAKRFDS GVVVGGYATI TNVSKEEYGE GDFTKGVYVS VPLDLFSSGP
TRSRAAIGWT PLTRDGGQQL GRKFQLYDMT SDRSVNFR