Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0599 |
Symbol | |
ID | 6144056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 608736 |
End bp | 609854 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615491 |
Product | carboxylate-amine ligase |
Protein accession | YP_001742697 |
Protein GI | 170683850 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02050] uncharacterized enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTAC CCGATTTTCA TGTTTCTGAA CCTTTTACCC TCGGTATTGA ACTGGAAATG CAGGTGGTTA ATCCGCCGGG CTATGACTTA AGCCAGGACT CTTCAATGCT GATTGACGCA GTTAAAAATA AGATCACGGC CGGAGAGGTA AAGCACGATA TCACCGAAAG TATGCTGGAG CTGGCGACGG ATGTTTGCCG TGATATCAAC CAGGCTGCCG GGCAATTTTC AGCGATGCAG AAAGTCGTAT TGCAGGCAGC CGCAGATCAT CATCTGGAAA TTTGCGGCGG TGGCACGCAC CCGTTTCAGA AATGGCAGCG TCAGGAGGTA TGCGATAACG AACGCTATCA ACGAACGCTG GAAAACTTTG GTTATCTCAT TCAACAGGCG ACCGTTTTTG GTCAGCATGT CCATGTTGGT TGCGCCAGTG GCGATGATGC CATTTATTTG CTGCACGGCC TGTCACGGTT TGTGCCGCAC TTTATCGCCC TTTCCGCCGC GTCGCCATAT ATGCAAGGAA CGGATACGCG TTTTGCCTCC TCACGACCGA ATATTTTTTC CGCCTTTCCC GATAATGGCC CGATGCCGTG GGTCAGTAAC TGGCAACAAT TTGAAGCCCT GTTTCGCTGC CTGAGTTACA CCACGATGAT CGACAGCATT AAAGATCTGC ACTGGGATAT TCGCCCCAGT CCTCATTTTG GCACGGTGGA GGTTCGGGTA ATGGATACCC CGTTAACCCT TAGCCATGCG GTAAATATGG CAGGATTGAT TCAGGCTACC GCCCACTGGT TACTGACGGA ACGCCCGTTT AAACATCAGG AGAAAGATTA CCTGCTGTAT AAATTCAACC GTTTCCAGGC CTGCCGCTAT GGGCTGGCAG GTGTCATTAC CGATCCGCAC ACTGGCGATC GTCGACCGCT AACGGAAGAC ACCTTGCGAT TGCTGGAAAA AATCGCCCCT TCTGCAAATA AAATTGGCGC ATCGAGCGCA ATTGAAGCCC TGCATCGCCA GGTCGTCAGC GGTCTGAATG AAGCGCAGCT GATGCGCGAT TTCGTCGCCG ATGGCGGCTC GCTGATTGGG CTGGTGAAAA AGCATTGTGA GATCTGGGCC GGTGACTAA
|
Protein sequence | MPLPDFHVSE PFTLGIELEM QVVNPPGYDL SQDSSMLIDA VKNKITAGEV KHDITESMLE LATDVCRDIN QAAGQFSAMQ KVVLQAAADH HLEICGGGTH PFQKWQRQEV CDNERYQRTL ENFGYLIQQA TVFGQHVHVG CASGDDAIYL LHGLSRFVPH FIALSAASPY MQGTDTRFAS SRPNIFSAFP DNGPMPWVSN WQQFEALFRC LSYTTMIDSI KDLHWDIRPS PHFGTVEVRV MDTPLTLSHA VNMAGLIQAT AHWLLTERPF KHQEKDYLLY KFNRFQACRY GLAGVITDPH TGDRRPLTED TLRLLEKIAP SANKIGASSA IEALHRQVVS GLNEAQLMRD FVADGGSLIG LVKKHCEIWA GD
|
| |