Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3421 |
Symbol | |
ID | 5591438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3422458 |
End bp | 3423585 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640922540 |
Product | AFG1 family ATPase |
Protein accession | YP_001460028 |
Protein GI | 157162710 |
COG category | [R] General function prediction only |
COG ID | [COG1485] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000000000090092 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGCG TTACCCCAAC ATCGCAATAC CTGAAGGCGC TCAATGAAGG CAGCCATCAA CCCGACGACG TTCAAAAAGA GGCCGTCAGC CGCCTGGAAA TTATTTATCA AGAACTCATC AATAGCAGGC CACCAGCCCC CAGGACGAGT GGGCTAATGG CGCGGGTCGG TAAGCTGTGG GGTAAACGCG AAGACACAAA GCATACGCCA GTGCGTGGCT TATATATGTG GGGCGGTGTA GGACGCGGGA AAACCTGGCT GATGGACCTT TTCTATCAAA GCCTGCCGGG AGAGCGGAAA CAGCGCCTGC ACTTTCACCG TTTTATGCTG CGGGTGCACG AAGAGCTAAC TGCCTTACAG GGGCAGACCG ATCCGCTGGA AATTATTGCC GATCGCTTTA AAGCCGAAAC TGACGTGCTC TGTTTTGACG AATTTTTTGT TTCTGATATT ACCGACGCCA TGCTACTTGG CGGTCTGATG AAAGCCCTGT TCACCCGCGG TATTACCCTG GTAGCGACGT CAAATATTCC GCCGGACGAA CTTTATCGAA ATGGCCTGCA ACGTGCGCGT TTTCTGCCTG CAATCGATGC CATTAAACAG CATTGTGATG TAATGAACGT GGACGCTGGT GTTGATTATC GACTGCGTAC ACTCACTCAG GCGCATCTGT GGCTTTCGCC ACTCAACGAT GAAACCCGGG CGCAGATGGA TAAACTATGG TTGGCGCTGG CGGGGGCGAA ACGAGAAAAT TCACCGACGT TAGAAATCAA CCATCGGCCA TTGGCGACAA TGGGCGTCGA GAACCAGACG CTGGCGGTCT CTTTTACTAC GCTGTGCGTC GACGCCCGCA GTCAGCATGA CTATATTGCG CTCTCACGTC TCTTTCATAC GGTCATGTTG TTTGATGTAC CAGTTATGAC GCGGTTGATG GAGAGCGAAG CGCGGCGCTT TATTGCGCTG GTGGATGAGT TTTACGAGCG CCATGTCAAA TTAGTGGTGA GTGCAGAAGT GCCGCTGTAT GCAATTTATC AGGGCGAGCG GCTGAAATTT GAGTTCCAGC GTTGCCTGTC ACGTCTGCAA GAGATGCAAA GCGAAGAGTA TCTGAAGCGC GAGCATTTAG CAGGTTAA
|
Protein sequence | MQSVTPTSQY LKALNEGSHQ PDDVQKEAVS RLEIIYQELI NSRPPAPRTS GLMARVGKLW GKREDTKHTP VRGLYMWGGV GRGKTWLMDL FYQSLPGERK QRLHFHRFML RVHEELTALQ GQTDPLEIIA DRFKAETDVL CFDEFFVSDI TDAMLLGGLM KALFTRGITL VATSNIPPDE LYRNGLQRAR FLPAIDAIKQ HCDVMNVDAG VDYRLRTLTQ AHLWLSPLND ETRAQMDKLW LALAGAKREN SPTLEINHRP LATMGVENQT LAVSFTTLCV DARSQHDYIA LSRLFHTVML FDVPVMTRLM ESEARRFIAL VDEFYERHVK LVVSAEVPLY AIYQGERLKF EFQRCLSRLQ EMQSEEYLKR EHLAG
|
| |