Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1619 |
Symbol | |
ID | 5593529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1640616 |
End bp | 1641929 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640920767 |
Product | PTS system lactose/cellobiose family IIC subunit |
Protein accession | YP_001458323 |
Protein GI | 157161005 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.711166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCAT TCGAACGTGG AATGGAACGT TTTCTTGTTC CAGTTGCTAT CAAGTTAAAC TCACAAAAAC ATGTTGCAGC GGTGAGAGAT GGATTCGTTT TTACGTTTCC AATTATCATG GCAAGTTCAT TAATTATATT AATTAACTTT GCCATATTAT CGCCAGACGG TTTTATTGCC GGATTACTGC ATCTGAATAG CATTTTCCCC AACCTTGAAA AAGCACAAGC TATTTTTACT CCGGTAATGA ATGGTTCTGT AAATATCATG TCAATTATGA TTGCTTTCCT GGTCGCCAGG AATATGGCGA TTAGCTATGA GCAAGATGAT CTTTTATGCG GATTAACGGC AATAGGAGCA TTTTTTATTG TTTATACGCC ATATCAGATG ATAGATGGGC AAGCATTCCT GACGACCAAA TATCTCGGCG CGCAGGGGTT GTTTGTTGCT GTTATCGTTG CATTGATCAC CAGTGAAATA TTTTGTCGCT TAGCTCGAAA CCCAAAAATC ACCATAACGA TGCCGGCAGC TGTACCGCCT GCGGTAGCGC GTTCATTTAA AGTTTTATTG CCAATATTTT TTGTCATGGT GTTCTTTTCC GCACTTAATT ATTGCCTGAC ACTGATATCC CCGGCAGGAC TAAACGATCT TATTTACACA TTAATCCAGA CGCCGCTCAA ACATATGGGA ACGAATATCT TTGCGGTAAT TATCCTGGGG GCTGTGGGTA ATTTCCTGTG GGTGCTGGGG ATCCACGGAC CTAATACCAC ATCGGCAATT CGAGAAACCG TTTTTTCTGA GGCTAATCTG GAGAATCTCT CCTGGGCCGC TCAACATGGC ACTACCTGGG GCGCGCCATA TCCGATTACC TGGACTTCTA TTAATGATGC ATTCGCCAAC TGCGGCGGTT CAGGTATGAC GTTGGGGTTA TTGTTGGCTA TTTTTATCGC TTCTAAGCGT GCGGAATACC GTGATCTGGC AAAAATGTCA TTTATCCCCG GTATTTTCAA TATCAATGAA CCGATAATGT TTGGCCTTCC TATTGTACTT AACCCCATCA TGATGGTGCC GTTTATTATG GTTCCCATTG TTAACTGTGC CATTGGTTAC TTCTTTGTTT CGATGGAAAT TATTTCACCG GTTGCTTATG CCGTGCCCTG GACTACGCCC GGACCTTTAA TTGCTTTCCT CGGAACCGGG GGGAACTGGC TGGCTTTACT GGTTGGTTTT TTATGTTTAG GTGTGGCGAC AATGATCTAT TTACCTTTTG TTATTGCCGC CAACAAAGTC AATAACATGA CAACTAACGG ATAA
|
Protein sequence | MASFERGMER FLVPVAIKLN SQKHVAAVRD GFVFTFPIIM ASSLIILINF AILSPDGFIA GLLHLNSIFP NLEKAQAIFT PVMNGSVNIM SIMIAFLVAR NMAISYEQDD LLCGLTAIGA FFIVYTPYQM IDGQAFLTTK YLGAQGLFVA VIVALITSEI FCRLARNPKI TITMPAAVPP AVARSFKVLL PIFFVMVFFS ALNYCLTLIS PAGLNDLIYT LIQTPLKHMG TNIFAVIILG AVGNFLWVLG IHGPNTTSAI RETVFSEANL ENLSWAAQHG TTWGAPYPIT WTSINDAFAN CGGSGMTLGL LLAIFIASKR AEYRDLAKMS FIPGIFNINE PIMFGLPIVL NPIMMVPFIM VPIVNCAIGY FFVSMEIISP VAYAVPWTTP GPLIAFLGTG GNWLALLVGF LCLGVATMIY LPFVIAANKV NNMTTNG
|
| |