Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1069 |
Symbol | |
ID | 5591785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1083344 |
End bp | 1085506 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640920234 |
Product | hypothetical protein |
Protein accession | YP_001457799 |
Protein GI | 157160481 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | [TIGR01666] hypothetical membrane protein, TIGR01666 [TIGR01667] integral membrane protein, YccS/YhfK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000000419542 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTTA TGCTAAGTCC TTTGCTCAAA CGCTATACCT GGAACAGCGC CTGGCTGTAT TACGCGCGTA TTTTTATTGC GCTTTGTGGA ACCACAGCGT TTCCGTGGTG GCTGGGTGAT GTAAAACTGA CGATTCCGCT AACGCTGGGG ATGGTGGCAG CGGCGCTGAC CGATCTCGAT GACCGACTGG CGGGACGTTT GCGTAACCTC ATCATTACGC TGTTCTGCTT TTTTATCGCC TCGGCCTCAG TAGAATTGCT GTTTCCCTGG CCCTGGCTAT TTGCGATTGG CTTAACGCTC TCTACCAGCG GCTTCATTTT GCTCGGCGGT CTGGGTCAAC GCTATGCAAC AATTGCCTTC GGTGCATTGC TGATCGCCAT TTACACTATG TTGGGAACAT CACTGTATGA GCACTGGTAT CAGCAGCCGA TGTATCTGCT GGCCGGTGCC GTCTGGTACA ACGTCCTGAC ACTTATTGGT CATCTGCTGT TCCCGGTCCG CCCGCTGCAG GACAACCTGG CGCGTTGCTA TGAACAACTG GCGCGTTATC TTGAGCTCAA GTCGCGCATG TTTGATCCTG ATATTGAAGA TCAAAGCCAG GCACCGCTGT ACGATTTGGC TCTCGCCAAC GGTCTGCTGA TGGCGACATT GAATCAGACG AAACTCTCGC TGCTGACCCG CTTACGTGGC GATCGTGGTC AACGGGGAAC GCGTCGCACG CTGCATTATT ACTTTGTCGC ACAGGATATT CACGAGCGTG CCAGCTCTTC TCATATTCAG TATCAAACAT TGCGTGAACA TTTTCGCCAC AGCGACGTGC TGTTCCGTTT TCAGCGGCTG ATGTCGATGC AGGGCCAGGC GTGCCAGCAA CTGTCACGCT GTATTTTGTT GCGTCAGCCT TATCAACATG ATCCGCATTT TGAGCGCGCT TTTACGCATA TTGATGCTGC GCTGGAGCGG ATGCGCGATA ACGGCGCACC CGCCGATTTA CTCAAAACAC TGGGATTTTT GCTGAACAAT TTACGCGCCA TTGATGCCCA ACTGGCAACA ATTGAATCAG AACAGGCCCA GGCACTACCC CATAATAATG ACGAAAATGA GCTCGCTGAT GACAGCCCGC ACGGGTTGAG TGATATCTGG CTGCGTCTTA GCCGTCACTT CACGCCGGAA TCCGCCCTCT TCCGTCATGC GGTAAGAATG TCGCTGGTGT TGTGCTTCGG CTACGCCATC ATTCAGATAA CCGGAATGCA TCACGGGTAT TGGATCTTGC TGACAAGTTT GTTTGTCTGC CAGCCAAACT ATAACGCCAC GCGCCACCGC CTGAAGTTAA GGATTATTGG TACGCTGGTA GGTATCGCCA TTGGCATTCC TGTGCTGTGG TTTGTGCCAT CACTGGAAGG GCAGCTGGTG CTGCTGGTTA TTACCGGCGT GCTCTTTTTT GCCTTCCGTA ACGTGCAATA CGCTCATGCA ACGATGTTCA TCACACTTTT GGTGCTACTG TGTTTTAACT TACTGGGTGA AGGTTTTGAA GTAGCGTTAC CTCGCGTAAT CGATACGCTG ATTGGTTGTG CCATTGCGTG GGCGGCAGTG AGCTACATCT GGCCTGACTG GCAGTTTCGC AATCTGCCGC GCATGCTCGA ACGCGCCACA GAGGCCAACT GTCGGTATCT CGATGCCATA CTGGAGCAAT ACCATCAGGG GCGTGATAAC CGTCTGGCGT ATCGTATTGC CCGCCGCGAT GCACACAACC GTGATGCTGA GCTGGCGTCG GTGGTATCAA ATATGTCCAG CGAGCCGAAC GTTACCCCGC AAATTCGCGA AGCCGCGTTT CGGTTGCTGT GCCTTAACCA TACGTTTACC AGCTATATCT CAGCCCTCGG TGCTCACCGG GAGCAGTTAA CTAATCCTGA AATTCTGGCG TTTCTTGATG ACGCAGTTTG CTATGTTGAT GACGCGTTAC ATCATCAACC TGCTGATGAA GAACGCGTCA ATGAGGCATT AGCTAGCCTG AAACAGCGGA TGCAGCAACT TGAACCACGG GCAGACAGCA AAGAACCTCT GGTCGTACAA CAAGTTGGAT TATTGATTGC ATTACTGCCT GAGATTGGTC GTCTGCAACG CCAGATTACT CAAGTTCCGC AGGAAACTCC TGTTTCGGCG TAA
|
Protein sequence | MAFMLSPLLK RYTWNSAWLY YARIFIALCG TTAFPWWLGD VKLTIPLTLG MVAAALTDLD DRLAGRLRNL IITLFCFFIA SASVELLFPW PWLFAIGLTL STSGFILLGG LGQRYATIAF GALLIAIYTM LGTSLYEHWY QQPMYLLAGA VWYNVLTLIG HLLFPVRPLQ DNLARCYEQL ARYLELKSRM FDPDIEDQSQ APLYDLALAN GLLMATLNQT KLSLLTRLRG DRGQRGTRRT LHYYFVAQDI HERASSSHIQ YQTLREHFRH SDVLFRFQRL MSMQGQACQQ LSRCILLRQP YQHDPHFERA FTHIDAALER MRDNGAPADL LKTLGFLLNN LRAIDAQLAT IESEQAQALP HNNDENELAD DSPHGLSDIW LRLSRHFTPE SALFRHAVRM SLVLCFGYAI IQITGMHHGY WILLTSLFVC QPNYNATRHR LKLRIIGTLV GIAIGIPVLW FVPSLEGQLV LLVITGVLFF AFRNVQYAHA TMFITLLVLL CFNLLGEGFE VALPRVIDTL IGCAIAWAAV SYIWPDWQFR NLPRMLERAT EANCRYLDAI LEQYHQGRDN RLAYRIARRD AHNRDAELAS VVSNMSSEPN VTPQIREAAF RLLCLNHTFT SYISALGAHR EQLTNPEILA FLDDAVCYVD DALHHQPADE ERVNEALASL KQRMQQLEPR ADSKEPLVVQ QVGLLIALLP EIGRLQRQIT QVPQETPVSA
|
| |