Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3528 |
Symbol | |
ID | 6145973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3605795 |
End bp | 3606922 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618357 |
Product | AFG1 family ATPase |
Protein accession | YP_001745504 |
Protein GI | 170683950 |
COG category | [R] General function prediction only |
COG ID | [COG1485] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000142323 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGCG TTACCCCAAC ATCGCAATAC CTGAAGGCGC TCAATGAAGG CAGCCATCAA CCCGACGACG TTCAAAAAGA GGCCGTCAGC CGCCTGGAAA TTATTTATCA GGAACTCATC AATAGCACGC CTCCAGCCCC TAGGACGAGT GGGCTAATGG CGCGGGTCGG TAAGCTGTGG AGTAAACGCG AAGACACAAA GCATACGCCA GTGCGTGGCT TATATATGTG GGGCGGTGTA GGACGCGGGA AAACCTGGCT GATGGACCTT TTCTATCAAA GCCTGCCGGG AGAGCGGAAA CAGCGTCTGC ACTTTCACCG TTTTATGCTG CGGGTGCACG AAGAGCTGAC CGAATTACAA GGCCAAAGCG ATCCGCTGGA AATTATTGCC GATCGCTTTA AAGCAGAAAC TGACGTGCTC TGTTTTGACG AATTTTTTGT TTCTGATATT ACCGATGCCA TGCTACTTGG CGGTCTGATG AAAGCCCTGT TCGCCCGAGG TATTACCCTG GTAGCGACGT CAAATATTCC GCCGGACGAA CTTTATCGAA ATGGCCTACA ACGTGCGCGT TTTTTGCCTG CAATCGATGC CATTAAACAG CATTGTGATG TAATGAACGT GGACGCTGGT GTTGATTATC GACTGCGTAC ACTCACTCAG GCGCATCTGT GGCTTTCGCC CCTCAACGAT GAAACCCGGG CGCAAATGGA TAAACTATGG TTGGCGCTGG CGGGGGCGAA ACGAGAAAAT TCACCGACAT TAGAAATCAA CCATCGGCCA TTGGCGACAA TGGGCGTCGA GAACCAGACG CTGGCGGTCT CTTTTACTAC GCTGTGCGTC GACGCCCGCA GTCAGCATGA CTATATTGCG CTCTCACGCC TCTTTCACAC GGTCATGTTG TTTGATGTAC CAGTTATGAC GCGGTTGATG GAGAGCGAAG CGCGGCGCTT TATTGCGCTG GTGGATGAGT TTTACGAGCG CCATGTCAAA TTAGTGGTGA GTGCAGAAGT GCCGCTATAT GACATTTATC AGGGCGAGCG GCTGAAATTT GAGTTCCAGC GTTGCCTGTC ACGTCTGCAA GAGATGCAAA GCGAAGAGTA TCTGAAGCGC GAGCATTTAG CGGGTTAA
|
Protein sequence | MQSVTPTSQY LKALNEGSHQ PDDVQKEAVS RLEIIYQELI NSTPPAPRTS GLMARVGKLW SKREDTKHTP VRGLYMWGGV GRGKTWLMDL FYQSLPGERK QRLHFHRFML RVHEELTELQ GQSDPLEIIA DRFKAETDVL CFDEFFVSDI TDAMLLGGLM KALFARGITL VATSNIPPDE LYRNGLQRAR FLPAIDAIKQ HCDVMNVDAG VDYRLRTLTQ AHLWLSPLND ETRAQMDKLW LALAGAKREN SPTLEINHRP LATMGVENQT LAVSFTTLCV DARSQHDYIA LSRLFHTVML FDVPVMTRLM ESEARRFIAL VDEFYERHVK LVVSAEVPLY DIYQGERLKF EFQRCLSRLQ EMQSEEYLKR EHLAG
|
| |