Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1076 |
Symbol | |
ID | 5591601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1090393 |
End bp | 1091583 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640920241 |
Product | hypothetical protein |
Protein accession | YP_001457806 |
Protein GI | 157160488 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.00716941 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG TGGGTCTTTT CCGGGGCCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGGGCAG AATATCAGCG CGCGGCATTA ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTTCGATTT ACGATCGCAG CGACGTCGCG GTACGTAAAA AAGAAGGAAT GGAGCTGACC CAGGGCCCCG TCACCGGCGA GTTGCCACCT GCCCTGCTGC CGATTGAAGA ACACGGAATG AAACTGCTGG TGGATATTCA GCACGGACAC AAAACGGGCT ACTACCTGGA CCAGCGTGAT AGCCGCCTGG CTACCCGCCG CTACGTTGAA AATAAACGTG TGCTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCGCTGGA TATTGCACGG CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC TTTAAATTGC TGCGTACTTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGGGG CTATAAAGAT ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC GGCCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT ACCTACCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
|
Protein sequence | MSVRLVLAKG REKSLLRRHP WVFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CSIYDRSDVA VRKKEGMELT QGPVTGELPP ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM
|
| |