Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3065 |
Symbol | |
ID | 6066162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3349853 |
End bp | 3350971 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602481 |
Product | carboxylate-amine ligase |
Protein accession | YP_001726016 |
Protein GI | 170021062 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02050] uncharacterized enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.202744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTAC CCGATTTTCA TGTTTCTGAA CCTTTTACCC TCGGTATTGA ACTGGAAATG CAGGTGGTTA ATCCGCCGGG CTATGACTTA AGCCAGGACT CTTCAATGCT GATTGACGCG GTTAAAAATA AGATCACGGC CGGAGAGGTA AAGCACGATA TCACCGAAAG TATGCTGGAG CTGGCGACGG ATGTTTGCCG TGATATCAAC CAGGCTGCCG GGCAGTTTTC AGCGATGCAG AAAGTCGTAT TGCAGGCAGC CACAGACCAT CATCTGGAAA TTTGCGGCGG TGGCACGCAC CCGTTTCAGA AATGGCAGCG TCAGGAGGTA TGCGATAACG AACGCTATCA ACGCACGCTG GAAAACTTTG GTTATCTCAT TCAGCAGGCG ACCGTTTTTG GTCAGCATGT CCATGTTGGC TGCGCCAGTG GCGATGACGC CATTTATTTG CTGCACGGCT TGTCACGATT TGTGCCGCAC TTTATCGCCC TTTCCGCCGC GTCGCCATAT ATGCAGGGAA CGGATACGCG TTTTGCCTCC TCACGACCGA ATATTTTTTC CGCCTTTCCT GATAATGGCC CGATGCCGTG GGTCAGTAAC TGGCAACAAT TTGAAGCCCT ATTTCGCTGT TTGAGTTACA CCACGATGAT CGACAGCATT AAAGATCTGC ACTGGGATAT TCGCCCCAGT CCTCATTTTG GCACGGTGGA GGTTCGGGTG ATGGATACCC CGTTAACCCT TAGCCACGCG GTAAATATGG CGGGATTAAT TCAGGCCACC GCCCACTGGT TACTGACGGA ACGCCCGTTC AAACATCAGG AGAAAGATTA CCTGCTGTAT AAATTCAACC GTTTCCAGGC CTGTCGCTAT GGGCTGGAAG GCGTCATTAC CGATCCGCAC ACTGGTGATC GTCGACCGCT AACGGAAGAT ACCTTGCGAT TGCTGGAAAA AATCGCCCCT TCCGCACATA AAATTGGCGC ATCGAGCGCA ATTGAAGCAC TGCATCGCCA GGTCGTCAGC GGTCTGAATG AAGCGCAGCT GATGCGCGAT TTCGTCGCCG ATGGCGGCTC GCTGATTGGG CTGGTGAAAA AGCATTGTGA GATCTGGGCC GGTGACTAA
|
Protein sequence | MPLPDFHVSE PFTLGIELEM QVVNPPGYDL SQDSSMLIDA VKNKITAGEV KHDITESMLE LATDVCRDIN QAAGQFSAMQ KVVLQAATDH HLEICGGGTH PFQKWQRQEV CDNERYQRTL ENFGYLIQQA TVFGQHVHVG CASGDDAIYL LHGLSRFVPH FIALSAASPY MQGTDTRFAS SRPNIFSAFP DNGPMPWVSN WQQFEALFRC LSYTTMIDSI KDLHWDIRPS PHFGTVEVRV MDTPLTLSHA VNMAGLIQAT AHWLLTERPF KHQEKDYLLY KFNRFQACRY GLEGVITDPH TGDRRPLTED TLRLLEKIAP SAHKIGASSA IEALHRQVVS GLNEAQLMRD FVADGGSLIG LVKKHCEIWA GD
|
| |