Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0235 |
Symbol | |
ID | 5592002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 254623 |
End bp | 256473 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640919422 |
Product | hypothetical protein |
Protein accession | YP_001457009 |
Protein GI | 157159691 |
COG category | [S] Function unknown |
COG ID | [COG3519] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03359] type VI secretion protein, VC_A0110 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTTG AAGAACGCTA TTTCCGGGAA GAACTCGATT ACCTGCGCCA GCTTAGCAAG CTGCTGGCAA CGGAAAAACC CCATCTGGCC CGCTTCCTGG CCGAAAAAGA TGCGGATCCG GATATTGAAC GCCTGCTGGA AGGGGTGGCT TTTCTTACCG GCAATCTCCG CCAGAAAATT GAGGATGAAT TCCCAGAACT GACGCACGGG CTTATTAAGA TGCTATGGCC TAATTACCTG CGTCCGGTTC CGGCAATGAC CCTTATTGAA TATACGCCGG ATATGGATAA GTCTTCTGTA CCGGTGTTAA TCCCCCGTAA TGAGCAGTTT ACAACCAACG CCGGGGAAAT CAGAGTTGAT GAAGTGCTGC CCTCTGATGC TAAAAAGGAG GAGCCGCCTC CCTGTACCTT CACTCTCTGC CGGGATATCT GGCTGCTGCC CGTTCGCCTG GAGCAGATTG AAAACCGCAG TACGACCCGT AATGGTGTTA TCAACATCAC CTTTTCGGTC GCACCGGGAA CGGACTTCCG CACGCTGGAT CTGAACAAAC TTCGCTTCTG GCTCGGCAAT GACGACAACT ATACCCGTGA CCAGCTTTAT TTATGGTTCT GCGAATACTT GCAGGGTGCC GACCTGACTG TGGGTGAACA GCATATTCGC CTGCCTGAGT TTATGCTAAA AGCTGTCGGT TTTGAGCCGC AGGATGCCAT GCTGCCCTGG CCGAAAAACG TCCACAGCGG CTACCGGATC CTTCAGGAGT ATTTCTGTTA CCCCGATGCG TTTCTCTTTT TTGATCTTTG TGGTTGTCCG GCTTTGCCTG ACGGATTGCA GGCGGAATTC TTTACCCTGC AACTGCGTTT TTCGCGCCCT TTGCCCGTGG ACATCCGGCT GCGCCGCGAT TCCCTGCGCC TGTATTGCGC ACCTGCCATT AATTTATTTA TCCACCATGC AGAAGCCATC ACGCTGGACA ACCGGCGGGC AGACTATCCG CTGGTTCCCA GCCGCCATTA CCCACAACAT TACGATGTAT TTTCCGTTAA CAGTGTGGTG AGCCAGGTCC AGGATATGTT CAGGAAAAAA GATCTGGGGC GTCCTGTTTC GACGCAGGCC GCGCGCCAGT GGCCAGCCTT TGAAAGTTTC AGCCATCAGA TGGAATACAG CCGGAAGCGG GAAGTGGTGT ACTGGCATCA CCGGACCAAA ACATCCCTGT TCCATCGCGG CTTTGATCAT ACCCTTGCCT TTATACATGC TGATGGCAGT TATCCGTCAG ACGAATCTCT GCTCAGTAAT GAAGTGGTTT CGGTATCGCT GACCTGTACC AACCGTGAGC TTCCGTCACA AATTCGTTCC GGCGATATCA CCGGCACAAC CGGTAAAAAT GCAGCTGTTG CTTCATTTCG CAACATTACC CGCCCGACGC AACCACTCTG GCCGGTCATT GATGGCAGCC TGCACTGGTC CCTACTCTCC GCCATGAACC TGAATTATCT GTCATTACTG GATACGGACG CGCTGAAGCA GGTCATCGCC AACTTTGATC GCCACGCAAT CCATCATCCG CAGACGGCAC GGCTGTCACA ACAAAAGCTG GATGCCATTG AGCGTCTGGA GACCCGCCCC GTTGATCGCC TGTTTACGGG TATTCCCGTC CGGGGACTGG CCTCCACGCT GTATCTGCAC CCGGAGCCGT TTGTCTGTGA AGGGGAAATG TATCTGCTCG GTACGGTGCT TTCGCATTTT CTGTCGCTGT ACGCCAGCGT TAACTCATTC CACATGCTGA CCGTTGTGAA CACAGAAAGC CAGGAGACAT GGAAATGGAC GGAAAGAATC GGGCAGCATC CTCTTATCTG A
|
Protein sequence | MEFEERYFRE ELDYLRQLSK LLATEKPHLA RFLAEKDADP DIERLLEGVA FLTGNLRQKI EDEFPELTHG LIKMLWPNYL RPVPAMTLIE YTPDMDKSSV PVLIPRNEQF TTNAGEIRVD EVLPSDAKKE EPPPCTFTLC RDIWLLPVRL EQIENRSTTR NGVINITFSV APGTDFRTLD LNKLRFWLGN DDNYTRDQLY LWFCEYLQGA DLTVGEQHIR LPEFMLKAVG FEPQDAMLPW PKNVHSGYRI LQEYFCYPDA FLFFDLCGCP ALPDGLQAEF FTLQLRFSRP LPVDIRLRRD SLRLYCAPAI NLFIHHAEAI TLDNRRADYP LVPSRHYPQH YDVFSVNSVV SQVQDMFRKK DLGRPVSTQA ARQWPAFESF SHQMEYSRKR EVVYWHHRTK TSLFHRGFDH TLAFIHADGS YPSDESLLSN EVVSVSLTCT NRELPSQIRS GDITGTTGKN AAVASFRNIT RPTQPLWPVI DGSLHWSLLS AMNLNYLSLL DTDALKQVIA NFDRHAIHHP QTARLSQQKL DAIERLETRP VDRLFTGIPV RGLASTLYLH PEPFVCEGEM YLLGTVLSHF LSLYASVNSF HMLTVVNTES QETWKWTERI GQHPLI
|
| |