Gene EcHS_A0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0235 
Symbol 
ID5592002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp254623 
End bp256473 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content53% 
IMG OID640919422 
Producthypothetical protein 
Protein accessionYP_001457009 
Protein GI157159691 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTTG AAGAACGCTA TTTCCGGGAA GAACTCGATT ACCTGCGCCA GCTTAGCAAG 
CTGCTGGCAA CGGAAAAACC CCATCTGGCC CGCTTCCTGG CCGAAAAAGA TGCGGATCCG
GATATTGAAC GCCTGCTGGA AGGGGTGGCT TTTCTTACCG GCAATCTCCG CCAGAAAATT
GAGGATGAAT TCCCAGAACT GACGCACGGG CTTATTAAGA TGCTATGGCC TAATTACCTG
CGTCCGGTTC CGGCAATGAC CCTTATTGAA TATACGCCGG ATATGGATAA GTCTTCTGTA
CCGGTGTTAA TCCCCCGTAA TGAGCAGTTT ACAACCAACG CCGGGGAAAT CAGAGTTGAT
GAAGTGCTGC CCTCTGATGC TAAAAAGGAG GAGCCGCCTC CCTGTACCTT CACTCTCTGC
CGGGATATCT GGCTGCTGCC CGTTCGCCTG GAGCAGATTG AAAACCGCAG TACGACCCGT
AATGGTGTTA TCAACATCAC CTTTTCGGTC GCACCGGGAA CGGACTTCCG CACGCTGGAT
CTGAACAAAC TTCGCTTCTG GCTCGGCAAT GACGACAACT ATACCCGTGA CCAGCTTTAT
TTATGGTTCT GCGAATACTT GCAGGGTGCC GACCTGACTG TGGGTGAACA GCATATTCGC
CTGCCTGAGT TTATGCTAAA AGCTGTCGGT TTTGAGCCGC AGGATGCCAT GCTGCCCTGG
CCGAAAAACG TCCACAGCGG CTACCGGATC CTTCAGGAGT ATTTCTGTTA CCCCGATGCG
TTTCTCTTTT TTGATCTTTG TGGTTGTCCG GCTTTGCCTG ACGGATTGCA GGCGGAATTC
TTTACCCTGC AACTGCGTTT TTCGCGCCCT TTGCCCGTGG ACATCCGGCT GCGCCGCGAT
TCCCTGCGCC TGTATTGCGC ACCTGCCATT AATTTATTTA TCCACCATGC AGAAGCCATC
ACGCTGGACA ACCGGCGGGC AGACTATCCG CTGGTTCCCA GCCGCCATTA CCCACAACAT
TACGATGTAT TTTCCGTTAA CAGTGTGGTG AGCCAGGTCC AGGATATGTT CAGGAAAAAA
GATCTGGGGC GTCCTGTTTC GACGCAGGCC GCGCGCCAGT GGCCAGCCTT TGAAAGTTTC
AGCCATCAGA TGGAATACAG CCGGAAGCGG GAAGTGGTGT ACTGGCATCA CCGGACCAAA
ACATCCCTGT TCCATCGCGG CTTTGATCAT ACCCTTGCCT TTATACATGC TGATGGCAGT
TATCCGTCAG ACGAATCTCT GCTCAGTAAT GAAGTGGTTT CGGTATCGCT GACCTGTACC
AACCGTGAGC TTCCGTCACA AATTCGTTCC GGCGATATCA CCGGCACAAC CGGTAAAAAT
GCAGCTGTTG CTTCATTTCG CAACATTACC CGCCCGACGC AACCACTCTG GCCGGTCATT
GATGGCAGCC TGCACTGGTC CCTACTCTCC GCCATGAACC TGAATTATCT GTCATTACTG
GATACGGACG CGCTGAAGCA GGTCATCGCC AACTTTGATC GCCACGCAAT CCATCATCCG
CAGACGGCAC GGCTGTCACA ACAAAAGCTG GATGCCATTG AGCGTCTGGA GACCCGCCCC
GTTGATCGCC TGTTTACGGG TATTCCCGTC CGGGGACTGG CCTCCACGCT GTATCTGCAC
CCGGAGCCGT TTGTCTGTGA AGGGGAAATG TATCTGCTCG GTACGGTGCT TTCGCATTTT
CTGTCGCTGT ACGCCAGCGT TAACTCATTC CACATGCTGA CCGTTGTGAA CACAGAAAGC
CAGGAGACAT GGAAATGGAC GGAAAGAATC GGGCAGCATC CTCTTATCTG A
 
Protein sequence
MEFEERYFRE ELDYLRQLSK LLATEKPHLA RFLAEKDADP DIERLLEGVA FLTGNLRQKI 
EDEFPELTHG LIKMLWPNYL RPVPAMTLIE YTPDMDKSSV PVLIPRNEQF TTNAGEIRVD
EVLPSDAKKE EPPPCTFTLC RDIWLLPVRL EQIENRSTTR NGVINITFSV APGTDFRTLD
LNKLRFWLGN DDNYTRDQLY LWFCEYLQGA DLTVGEQHIR LPEFMLKAVG FEPQDAMLPW
PKNVHSGYRI LQEYFCYPDA FLFFDLCGCP ALPDGLQAEF FTLQLRFSRP LPVDIRLRRD
SLRLYCAPAI NLFIHHAEAI TLDNRRADYP LVPSRHYPQH YDVFSVNSVV SQVQDMFRKK
DLGRPVSTQA ARQWPAFESF SHQMEYSRKR EVVYWHHRTK TSLFHRGFDH TLAFIHADGS
YPSDESLLSN EVVSVSLTCT NRELPSQIRS GDITGTTGKN AAVASFRNIT RPTQPLWPVI
DGSLHWSLLS AMNLNYLSLL DTDALKQVIA NFDRHAIHHP QTARLSQQKL DAIERLETRP
VDRLFTGIPV RGLASTLYLH PEPFVCEGEM YLLGTVLSHF LSLYASVNSF HMLTVVNTES
QETWKWTERI GQHPLI