Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4387 |
Symbol | |
ID | 6144833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4475606 |
End bp | 4476733 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619208 |
Product | hypothetical protein |
Protein accession | YP_001746332 |
Protein GI | 170683638 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAATC TGTTTGCGAT TAATGTCTGT AAAACGTTCG GTTGTCGAAA TTTGGGATTG GCCTCATCGG AAGATTACAG TTGGCCAGAC TATAAACTGG GTTTTCCGGC ACTGCACTGC CAGGCATGCG GAAGTTACCC CCCTTTGTTT GATGAACAAC AATTTCGTGA CTGGTTGTCA GTTCACATGA CTGCGTGTGC CATAGAAACA GGGCATTTTT GTCCATCATG CTATTGTAAA GAAAAGATTT TTTATGGCCA CAACCCGCAG GGAACACAAC GCGTTCAATG CCGTTCATGC AAAAAAGTCT GGACTCCGAA ACAGCAGCCA GCAAGAGAAA TTACGTATCC CCAGTCCATT GAAACCGTTC AACTGTTCAT GCCCTTTCAA GGAGCCAGCG CAGTACAAAA GCTCTATGTT TTAGTCAGTC TTGACGCCAC TCGTGGCAAC ATCCTTCATC TCTCCACGAA TTATACGCAA CATCAGACAG GAGACAGTCT GCGGTACAGT TACAGAGGCA ATACAGAACC AACTGAGCAC CATAGCGATA TTGTGCAGAG GGTAGATATG CGTGAAGCGC AATTCTTGCG CCGGAGCCAG TTCGATGAAA TTCAGTATGG CAGTGCTGTG CTCAAGCGCA ACGGCAAGGG AGCCATACTA CGCCCGGTTA TCACGGCACA CGGGCATTTC AGAATACTGA AAATCCGCTT TCCACATGTC AAAACGCATA TCATTTCACA CGAATGTTTT CTGAGAGGCG CAATTATTAC AGCCTGGGCA GATCAGTTCC GCCAACAACA AGGCGAACTT TGGTTCGTAG AAGAAGAAAT CAGCGACCGT AACGCTGATA TTCCCTGGCA TTTTCAGGGA ACGACATACC ATGGTTGGTG GCAAAATCAG TGGCAACGCT GGGGGCAGGG GAATAACAGC AAGATGGTCT GCCTGCTCAC AGGAGTCTCC TTAGAAAGGG GCGCAAATGT TTCTCTGGCA ACCAGTCGTT GCTTTATCAC ATGGCTGACA GACCAACACG ACTTTACCCA AAGCGCGTTA TTATCCGCAG GTCGCGTAAC GAAAATGCTG ACCTCACTGG CGTTAAAATA CAATGAATCG CTCACTCCAT CTTGTTAG
|
Protein sequence | MSNLFAINVC KTFGCRNLGL ASSEDYSWPD YKLGFPALHC QACGSYPPLF DEQQFRDWLS VHMTACAIET GHFCPSCYCK EKIFYGHNPQ GTQRVQCRSC KKVWTPKQQP AREITYPQSI ETVQLFMPFQ GASAVQKLYV LVSLDATRGN ILHLSTNYTQ HQTGDSLRYS YRGNTEPTEH HSDIVQRVDM REAQFLRRSQ FDEIQYGSAV LKRNGKGAIL RPVITAHGHF RILKIRFPHV KTHIISHECF LRGAIITAWA DQFRQQQGEL WFVEEEISDR NADIPWHFQG TTYHGWWQNQ WQRWGQGNNS KMVCLLTGVS LERGANVSLA TSRCFITWLT DQHDFTQSAL LSAGRVTKML TSLALKYNES LTPSC
|
| |