Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4054 |
Symbol | |
ID | 6145230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4144827 |
End bp | 4146041 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618879 |
Product | hypothetical protein |
Protein accession | YP_001746017 |
Protein GI | 170681741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGA TAACCTTTGC TCCCCGTAAT CACCTGCTCA CCAATACCAA TACCTGGACG CCCGACAGCC AGTGGCTGGT ATTTGACGTG CGTCCTTCTG GCGCGTCGTT TACCGGCGAG ACCATTGAGC GTGTGAATAT CCATACCGGC GAGGTCGAGG TTATCTATCG CGCGTCACAG GGCGCGTACG TCGGCGTGGT GACCGTTCAT CCGAAGTCAG AGAAATATGT TTTCATCCAC GGCCCGGAAA ATCCTGATGA AACATGGTAT TACGACTTCC ATCATCGGCG TGGAGTGATT GTTGAAGGCG GCAAGGTGAG CAATCTCGAT GCAATGGATA TTACCGCGCC GTACACCCCA GGAGCACTGC GCGGCGGCAG CCATGTGCAT GTCTTTAGCC CGAACGGTGA AAGGGTGAGC TTTACCTATA ACGACCATGT AATGCATCAA CTCGATTCGG CGCTGGATTT GCGAAACGTC GGCGTGGCTG CACCGTTTGG CCCGGTCAAC GTACAAAAGC AGCATCCGCG TGAATACAGC GGTAGCCACT GGTGCGTGCT GGTGAGTAAA ACCACGCCCA CGCCGCAGCC TGGCAGCGAT GAAATCAATC GTGCTTATGA AGAAGGATGG GTAGGAAATC ACGCGCTGGC GTTTATTGGC GACACACTTT CGCCAAAGGG CGAGAAAGTG CCGGAGCTAT TTATCGTTGA GTTACCGCAA GATGAAGCTG GCTGGAAAGC GGCAGGTGAT GCGCCGTTAA GTGGAACGGA AACAACCCTG CCCGCGCCAC CGCGTGGCGT CGTGCAGCGA CGTTTAACCT TTACCCACCA TCGGGCTTAT CCGGGGTTAG TCAACGTCCC GCGCCACTGG GTGCGCTGTA ATCCGCAGGG TACGCAAATC GCGTTTTTAA TGCGTGATGA TAACGGCATT GTGCAACTGT GGCTTATCTC GCCACAGGGC GGCGAGCCGC GCCAGTTAAC CCATAACAAA ACGGATATTC AGTCTGCATT TAACTGGCAT CCGTCAGGAG AATGGTTGGG CTTTGTGCTG GATAATCGAA TTGCTTGCGC CCATGCGCAA AGCGGCGAGG TTGAGTATTT AACCGAAAAC CACGCCAATC CACCTTCTGC GGACGCCGTG GTCTTCTCGC CGGATGGTCA ATGGCTGGCG TGGATGGAAG GTGGCCAGCT GTGGATCACC GAAACTGATC GCTAA
|
Protein sequence | MKQITFAPRN HLLTNTNTWT PDSQWLVFDV RPSGASFTGE TIERVNIHTG EVEVIYRASQ GAYVGVVTVH PKSEKYVFIH GPENPDETWY YDFHHRRGVI VEGGKVSNLD AMDITAPYTP GALRGGSHVH VFSPNGERVS FTYNDHVMHQ LDSALDLRNV GVAAPFGPVN VQKQHPREYS GSHWCVLVSK TTPTPQPGSD EINRAYEEGW VGNHALAFIG DTLSPKGEKV PELFIVELPQ DEAGWKAAGD APLSGTETTL PAPPRGVVQR RLTFTHHRAY PGLVNVPRHW VRCNPQGTQI AFLMRDDNGI VQLWLISPQG GEPRQLTHNK TDIQSAFNWH PSGEWLGFVL DNRIACAHAQ SGEVEYLTEN HANPPSADAV VFSPDGQWLA WMEGGQLWIT ETDR
|
| |