Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0310 |
Symbol | |
ID | 5593468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 313576 |
End bp | 315759 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640919496 |
Product | hypothetical protein |
Protein accession | YP_001457082 |
Protein GI | 157159764 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01216] ATP synthase, F1 epsilon subunit (delta in mitochondria) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACT GGCATCGCAA GGTGCTGTGC AACTTTGATA ATGCCTGGTC AGCAACGCAG GATATGCGTG AGCAGATTAT TGAGGCTCAA CGTTTCGTCA GGGTGTCCGG CGCACAGTGG GAAGGCAGCA CAAACGCTGG TTACTCATTT GATGAAGGCA GGTTTGAGCA TTATCCGCGT TTTGAACTGA ATAAGATTGC CCGTGAATGT GATCGCATCA TTGGCGAGTA TCGACAGAAT CGCATCAGCG TTAAATTCAG GCCGAAGGAC GATAAGGCAT CGGAAGCGTT AGCCGAAAAG ATGAACGGCA AATTCCGCGC TGACTATCAG GAAACATCCG GTGGCGAAGC GTGTGATAAC GCATTTGATG ATGCCGTAAC GGGTGGATTC GGTTGTTTCC GCATGTGTGC CGATTACGAA GATGAAATGG ATCCGAGTAA CGAGCAGCGA CGCATCAGCC TTCTTCCTGT TTACGACCCA GCGACATGCG TCTTCTTCGA TCAGGACAGC AAGCAATATG ACCGTTCTGA TGCTATGTGG GCTATGGAAA TGTTCTCCAT GACGCCTAAA GCGTTCGAGG CTGAATACCC TGATTCCATC GCGGCAAGCC TTTCTCGTGA TGACACTGGT ACTCAGTATG ACTGGTCAAC GCCCGATGCC ATCTATGTTG GACGCTACTA CGAAGTTCGC ATAGAGAAGG TGAAGCTCAC GGCGTGGCGC AACCCTGTTA GCGGAGAAAC GGCAATTTAT GATGAAGAGC AAATCAAAGA TATTGTCGAC GAGCTGACCG ATGGTGCATT CGAACTGATT GGCGAGCGAA CGGTGAAGAA GCGCCGCGTT TATTGCGGTC TTCTGTCTGG CGCTGAATGG CTGGAAGAAC CTAAGCGTAT TCCGGGCGAA CATATTCCTC TCATCCCGGT ATATGGGCGT CGCTCATTTG TTGATAATCA GGAGCGAATC GAAGGCCACG CAGCAAAAGC GATGGATGCA CAGCGTCTTG AGAACCTGAT GGTTTCCATG ATTGCAGATA ACGCTACTCA GGCTGGCGGT GATGGCATTC CTGTAGTTGA TGTTGACATG ATTCCTGGTC CTCTCGCCAC TCATTGGGCG GAGCGCAACA AAAAGCGCCC GGCGTTCCTG CCGATGGTCA GTCTGAAAAA CAAAAACGGA GATATTACTG CGCAGGCTCA GGTCAGCAGT TATACGCCTC CGACACAAAT GCCTCCAGCT CTTGCCGGGC TATTGCAGTA CACCGGAACG GCTATTCAGC AAATTACAGG TGCGTCGCAG ATTGAGAACA TGCCGAGCAA CGTCGCCACC GATACCGTTG ATAGCATCTT TAACCGGATG GACACGCAGT CCTATATCTA CATGGACAAC ATGGCTAAAT CCATGCGTCG CGCTGGCGTT GTGTGGCTTT CTATGGCGCG TGAGGTCTAT GGCAGTGATA CGCCGATGCG TATCGTTAAT GAGGACGGCA GCGATGACGT GGCGTTGATG ACTGGTGAAG TGGTTGACCG TCAGACAGGG CAGGTTATCG CGCTTAACGA CCTTTCGCAG GGTAACTATG AAGTGACTGT CGATGTCGGT CAGTCGTTCG CTACTCGCCG TGATGCAACG GTTAAGTCGT TACTTTCCAT GCTGGCACTT ATCCCACCAG GAACGCCGAA GCACGATCTT GTATCGTCGA TGATTCTCGA CAATATGGAC GGCGAAGGGA TGGACGACCT TAAAGAATAC AACCGCAATC AGTTGCTTCT GTCTGGCGTT ATCAAGCCGA GAACGCCTGA AGAACAGCAG ATGGTTGAAC AGGCGAAACA ACAACAGGCC AGTCAGCCAG ATCCGGCTAT GGTTGCTGCG CAAGGTCAGC TTCTTGCTGG TCAGGCTGAA TTGCAGAAAG CGCAGAACGA GCAGGCAGCC ATTCAGGTTA AAGCATTCCA GGCACAGACT GATGCTCAGG TTGCAGCGGC AAACGTTGTG AAAATCCTCG CATCTGCCGA TAGCCAGCAG AAATCTGATA TCCGCGAGGC TCTGAAACTG CTCGGACAGT TCCAGCAACA GCAAGGAGAT AATGCCCGTG CTGATGCAGA GCTTGTCCTG AAAAGTCAGG CACAGGGCCA TGCGCAGCGC ATGGACATCA GCAGCATCCT GCAAAAATCA ACTCAGCAAC AACCACAGCA GTAA
|
Protein sequence | MTDWHRKVLC NFDNAWSATQ DMREQIIEAQ RFVRVSGAQW EGSTNAGYSF DEGRFEHYPR FELNKIAREC DRIIGEYRQN RISVKFRPKD DKASEALAEK MNGKFRADYQ ETSGGEACDN AFDDAVTGGF GCFRMCADYE DEMDPSNEQR RISLLPVYDP ATCVFFDQDS KQYDRSDAMW AMEMFSMTPK AFEAEYPDSI AASLSRDDTG TQYDWSTPDA IYVGRYYEVR IEKVKLTAWR NPVSGETAIY DEEQIKDIVD ELTDGAFELI GERTVKKRRV YCGLLSGAEW LEEPKRIPGE HIPLIPVYGR RSFVDNQERI EGHAAKAMDA QRLENLMVSM IADNATQAGG DGIPVVDVDM IPGPLATHWA ERNKKRPAFL PMVSLKNKNG DITAQAQVSS YTPPTQMPPA LAGLLQYTGT AIQQITGASQ IENMPSNVAT DTVDSIFNRM DTQSYIYMDN MAKSMRRAGV VWLSMAREVY GSDTPMRIVN EDGSDDVALM TGEVVDRQTG QVIALNDLSQ GNYEVTVDVG QSFATRRDAT VKSLLSMLAL IPPGTPKHDL VSSMILDNMD GEGMDDLKEY NRNQLLLSGV IKPRTPEEQQ MVEQAKQQQA SQPDPAMVAA QGQLLAGQAE LQKAQNEQAA IQVKAFQAQT DAQVAAANVV KILASADSQQ KSDIREALKL LGQFQQQQGD NARADAELVL KSQAQGHAQR MDISSILQKS TQQQPQQ
|
| |