Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2325 |
Symbol | |
ID | 6145387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2356384 |
End bp | 2357940 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617199 |
Product | hypothetical protein |
Protein accession | YP_001744372 |
Protein GI | 170681845 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.802767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00255877 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCATAC GCGCTCCCAA TTCTGGACGT AAGCTCCTGC TTACCTGCAT TGTTGCAGGC GTGATGATTG CGATACTGGT GAGCTGCCTT CAGTTTTTAG TGGCCTGGCA TAAGCACGAA GTCAAATACG ACACACTGAT TACCGACGTA CAAAAGTATC TCGATACCTA TTTTGCCGAC CTGAAATCCA CTACTGACCG GCTTCAGCCG CTGACCTTAG ATACCTGCCA GCAGGCTAAC CCCGAACTGA CCGCTCGCGC GGCGTTTAGC ATGAATGTCC GTACGTTTGT GCTGGTGAAA GATAAAAAAA CATTCTGTTC ATCTGCGACT GGTGAGATGG ACATTCCACT AAAAGAATTG ATTCCGGCGC TCGACATTAA TAAAAATGTC GATATGGCGA TCTTACCCGG TACGCCGATG GTGCCGAACA AACCCGCAAT CGTCATCTGG TATCGCAACC CTTTGCTGAA AAATAGCGGC GTCTTTGCCG CTCTGAATCT CAACCTGACG CCTTCACTCT TTTATAGTTC ACGGCAGGAA GATTACGATG GCCTCGCCCT CATTATTGGT AATACTGCGC TATCTACCTT TTCTTCACGT TTGATGAATG TTAATGAATT AACCGACATG CCGGTCCGTG AAACTAAAAT TGCGGGCATT CCTCTGACCG TTCGGCTTTA TGCGGATGAC TGGACATGGA ACGATGTGTG GTACGCATTT TTACTGGGTG GCATGAGTGG AACTTTCGTT GGACTTCTCT GCTATTACCT GATGAGTGTG CGTATGCGCC CAGGCAGAGA AATCATGACC GCCATCAAGC GCGAACAATT TTACGTGGTA TATCAACCGG TGGTTGATAC ACAAGCTTTG CGGGTAACGG GCCTGGAAGT ACTGCTACGC TGGCGGCATC CAGTAGCAGG AGAAATCCCC CCGGATGCCT TCATTAACTT TGCCGAAGCG CAAAAGATGA TTGTGCCACT GACTCAGCAC CTGTTTGAGT TGATTGCCCG CGATGCCGCA GAATTAGAAA AAGTACTGCC GGTAGGCGTC AAATTTGGCA TTAACATTGC GCCGGCCCAC TTGCACAGCG AAAGCTTTAA AGCGGATATC CAGAAACTGC TCACTTCCCT GCCCGCACAC CATTTCCAGA TTGTGCTGGA AATTACCGAG CGCGATATGC TGAAAGAGCG AGAAGCCACA CAACTCTTCG CCTGGCTGCA TTCGGTCGGC GTAGAAATTG CTATTGATGA CTTCGGCACC GGGCACAGCG CGCTTATCTA TCTTGAGCGT TTTACGCTCG ATTATCTGAA AATTGATCGT GGATTTATCA ACGCCATCGG TACGGAAACG ATCACTTCAC CCGTACTTGA CGCGGTGCTG ACGCTGGCGA AACGTCTCAA TATGCTGACA GTTGCTGAAG GGGTCGAAAC GCCAGAACAG GCACGATGGC TAAGCGAACG CGGCGTTAAT TTCATGCAAG GCTACTGGAT TAGTCGCCCG TTACCGCTGG ACGATTTTGT TCGCTGGCTG AAGAAACCGT ATACGCCGCA GTGGTAA
|
Protein sequence | MFIRAPNSGR KLLLTCIVAG VMIAILVSCL QFLVAWHKHE VKYDTLITDV QKYLDTYFAD LKSTTDRLQP LTLDTCQQAN PELTARAAFS MNVRTFVLVK DKKTFCSSAT GEMDIPLKEL IPALDINKNV DMAILPGTPM VPNKPAIVIW YRNPLLKNSG VFAALNLNLT PSLFYSSRQE DYDGLALIIG NTALSTFSSR LMNVNELTDM PVRETKIAGI PLTVRLYADD WTWNDVWYAF LLGGMSGTFV GLLCYYLMSV RMRPGREIMT AIKREQFYVV YQPVVDTQAL RVTGLEVLLR WRHPVAGEIP PDAFINFAEA QKMIVPLTQH LFELIARDAA ELEKVLPVGV KFGINIAPAH LHSESFKADI QKLLTSLPAH HFQIVLEITE RDMLKEREAT QLFAWLHSVG VEIAIDDFGT GHSALIYLER FTLDYLKIDR GFINAIGTET ITSPVLDAVL TLAKRLNMLT VAEGVETPEQ ARWLSERGVN FMQGYWISRP LPLDDFVRWL KKPYTPQW
|
| |