Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4199 |
Symbol | |
ID | 6144006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4299229 |
End bp | 4300185 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619022 |
Product | carbamate kinase |
Protein accession | YP_001746150 |
Protein GI | 170681211 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0549] Carbamate kinase |
TIGRFAM ID | [TIGR00746] carbamate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00169191 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTAAGC CACTGGCTGT CGTCGCGGTT GGCGGCAATG CGCTCATTCA GGACGAGCAA CGCAATAGTA TTCCCGATCA ATATGTTGCA GTGATGGAAA GCGTGCAGCA TATCGTTGAT ATGGTTGAAG CCGGATGGGA CCTGGTGCTA ACCCACGGTA ATGGCCCGCA GGTGGGCTTT ATTCTGCGCC GCTCTGAACT CGCCAGTAAC GAAGTTTCTC CGGTTCCACT TGATTACGCC GTGGGTGATA CACAAGGTGC AATTGGCTAC ATGTTCCAGA AAGCACTGCA TAACGAATTG GCTCGCCGTG GCATAAACAA ACCGGTAATT GCCCTGGTGA CACAAACGCG AGTCAGCCCA CATGACGATG CTTTCGCCAG CCCCAGTAAA CCAATTGGCG CGTTTCTCGA TGAAGCAACG GCCCAACAAC GCCAACAACA ACTCGGCTGG ACGCTGATGG AGGACGCCGG GCGTGGTTGG CGGCGTACAG TTCCCTCTCC TGCCCCACTG GAAGTTATTG AGCACGACAC CATCGCTCAC CTGGTGCGCC AGGGATATCT GGTTATTGCC TGCGGCGGCG GCGGTATTCC GGTGGTGCGA GACGGACAAC AACTGAAAGG TGTAGAAGCC GTGATCGATA AAGATCTGGC CTCCGCGCTG CTCGCCAGTC AGTTAGGCGC AGATCTGCTG GTGATCCCCA CCGGTGTAGA AAAAGTGGCG ATTAACTTTG GTACACCACA ACAACAGTGG CTCGACGCTA TCAGCGTTGC CGAAGCGCAA ACGCTGTTGC GGGAAGGTCA GTTTGGTGTC GGCAGTATGC AACCCAAAGT GGAAGCCATT GTTGATTTCA TCAATGCCAG CCAGCAACAA GGCAAACAGG CCAGCGGCCT GATTACTTCA CCGCAAACCA TAAAAGCAGC CCTGGCGCAT CAGAGCGGTA CATGGATAAC CCTTTAA
|
Protein sequence | MVKPLAVVAV GGNALIQDEQ RNSIPDQYVA VMESVQHIVD MVEAGWDLVL THGNGPQVGF ILRRSELASN EVSPVPLDYA VGDTQGAIGY MFQKALHNEL ARRGINKPVI ALVTQTRVSP HDDAFASPSK PIGAFLDEAT AQQRQQQLGW TLMEDAGRGW RRTVPSPAPL EVIEHDTIAH LVRQGYLVIA CGGGGIPVVR DGQQLKGVEA VIDKDLASAL LASQLGADLL VIPTGVEKVA INFGTPQQQW LDAISVAEAQ TLLREGQFGV GSMQPKVEAI VDFINASQQQ GKQASGLITS PQTIKAALAH QSGTWITL
|
| |