Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2049 |
Symbol | |
ID | 6375742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2211293 |
End bp | 2212546 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642684540 |
Product | hypothetical protein |
Protein accession | YP_001960440 |
Protein GI | 189500970 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0404776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.28039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATAG TCATATTCGA AGACGAGAAA GTTCACGGGT TTCACCCGCT GGTGCATTTC AAGCCTGTTT ACGGCCTGTT TACCGGATGC AGGAATCTTT TGCAGAAGTT TTGTTTTTAC CTGGGCGCCG ATGTTACATT TTCCTGCCAT CTTCGCCGTT ATCTCCAACC CTATTACCGT TCTCATCTTC CGGTTTTTCA GCCCGGTGTC GATCCGCAGA GAGATATTCT GCTGGTAAAC GGTCGATTGT TGTGTGATGA GAAAGCGGCA ACCATCATAC GTGATCATCC CCCCGATCCC GGCCAGTGTC TGATGCAGGG AGACGAACTT GTCCTCGCGA GAGTCAATGA GGCCCGTATT GTGTCAGCTG ATAACATGCT TCCTGATTAT TTCGATACGC AACAGCTGGC AGCAGAAAGT GAGACCGTCG TTGCGGAGGG ATTCAGGTTG CTGCGAAATA TCTGGGACCC GGTCGCTTTT CATCCTGGAG AGCTGCATCG TGAAGCTTTT TCACTCGAGC TCGGAAGCAT CTCCGGCAGA GTCTCGTCAC GCGCGGGCCT GGAGAATCCG GAAAGTATAT TTATCGGTGA GGGAGCTGTG ATCAAGGCGG GAGCCCTGCT GGATGCTGAA GAGGGATTTG TGTATGTGAG TCCCGGCGCG GTCGTGGAGC CTCAGGTGGT TCTTGCAGGA AACGTTTTCG CGGGTGAGTT CTCCTGTGTC AGGACCGGAG CGAATCTGCA CAGCAATGTC TTTGTCGGCA GGGCGTCAAA AGCCGGGGGT GAGATAGAGG ATGCCGTTAT AGAGCCCTAT GCGAACAAAC AGCATGAGGG TTTTCTCGGT CACTCGTATA TCTCTTCGTG GTGTAATCTC GGGGCGGGAA CAAATACATC GGATTTGAGG AACAACTACG GCAAAGTAAA GCTACAGGTT GAAAATAAGG AGTTTCGCAC CGGTGAGCAG TTCCTCGGGC TTCTTATGGG AGAGCATACG AAGTGTTCTA TTAACTCGAT GTTCAATACC GGTACCGTCG CAGGCGCTTC TTCAAATATT TTCGGTGGCG GATTTCCTCC TAAATATATA CCTTCTTTTT CCTGGGGAGG GCCCGGATCG GGTTTTCAGC CCTATGAGAT AGAAAAAGCG GTTGCAACCG CACGTGTTGT TATGGGCCGC CGAAATATCA GGATGTGCGA TGCCTACGAG ACAATGTTCC GTTATGTCGC GGCTGTTGAA CAGGATAGTG GTACCGCTGT GTAG
|
Protein sequence | MQIVIFEDEK VHGFHPLVHF KPVYGLFTGC RNLLQKFCFY LGADVTFSCH LRRYLQPYYR SHLPVFQPGV DPQRDILLVN GRLLCDEKAA TIIRDHPPDP GQCLMQGDEL VLARVNEARI VSADNMLPDY FDTQQLAAES ETVVAEGFRL LRNIWDPVAF HPGELHREAF SLELGSISGR VSSRAGLENP ESIFIGEGAV IKAGALLDAE EGFVYVSPGA VVEPQVVLAG NVFAGEFSCV RTGANLHSNV FVGRASKAGG EIEDAVIEPY ANKQHEGFLG HSYISSWCNL GAGTNTSDLR NNYGKVKLQV ENKEFRTGEQ FLGLLMGEHT KCSINSMFNT GTVAGASSNI FGGGFPPKYI PSFSWGGPGS GFQPYEIEKA VATARVVMGR RNIRMCDAYE TMFRYVAAVE QDSGTAV
|
| |