Gene Cphamn1_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2049 
Symbol 
ID6375742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2211293 
End bp2212546 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content51% 
IMG OID642684540 
Producthypothetical protein 
Protein accessionYP_001960440 
Protein GI189500970 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0404776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.28039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATAG TCATATTCGA AGACGAGAAA GTTCACGGGT TTCACCCGCT GGTGCATTTC 
AAGCCTGTTT ACGGCCTGTT TACCGGATGC AGGAATCTTT TGCAGAAGTT TTGTTTTTAC
CTGGGCGCCG ATGTTACATT TTCCTGCCAT CTTCGCCGTT ATCTCCAACC CTATTACCGT
TCTCATCTTC CGGTTTTTCA GCCCGGTGTC GATCCGCAGA GAGATATTCT GCTGGTAAAC
GGTCGATTGT TGTGTGATGA GAAAGCGGCA ACCATCATAC GTGATCATCC CCCCGATCCC
GGCCAGTGTC TGATGCAGGG AGACGAACTT GTCCTCGCGA GAGTCAATGA GGCCCGTATT
GTGTCAGCTG ATAACATGCT TCCTGATTAT TTCGATACGC AACAGCTGGC AGCAGAAAGT
GAGACCGTCG TTGCGGAGGG ATTCAGGTTG CTGCGAAATA TCTGGGACCC GGTCGCTTTT
CATCCTGGAG AGCTGCATCG TGAAGCTTTT TCACTCGAGC TCGGAAGCAT CTCCGGCAGA
GTCTCGTCAC GCGCGGGCCT GGAGAATCCG GAAAGTATAT TTATCGGTGA GGGAGCTGTG
ATCAAGGCGG GAGCCCTGCT GGATGCTGAA GAGGGATTTG TGTATGTGAG TCCCGGCGCG
GTCGTGGAGC CTCAGGTGGT TCTTGCAGGA AACGTTTTCG CGGGTGAGTT CTCCTGTGTC
AGGACCGGAG CGAATCTGCA CAGCAATGTC TTTGTCGGCA GGGCGTCAAA AGCCGGGGGT
GAGATAGAGG ATGCCGTTAT AGAGCCCTAT GCGAACAAAC AGCATGAGGG TTTTCTCGGT
CACTCGTATA TCTCTTCGTG GTGTAATCTC GGGGCGGGAA CAAATACATC GGATTTGAGG
AACAACTACG GCAAAGTAAA GCTACAGGTT GAAAATAAGG AGTTTCGCAC CGGTGAGCAG
TTCCTCGGGC TTCTTATGGG AGAGCATACG AAGTGTTCTA TTAACTCGAT GTTCAATACC
GGTACCGTCG CAGGCGCTTC TTCAAATATT TTCGGTGGCG GATTTCCTCC TAAATATATA
CCTTCTTTTT CCTGGGGAGG GCCCGGATCG GGTTTTCAGC CCTATGAGAT AGAAAAAGCG
GTTGCAACCG CACGTGTTGT TATGGGCCGC CGAAATATCA GGATGTGCGA TGCCTACGAG
ACAATGTTCC GTTATGTCGC GGCTGTTGAA CAGGATAGTG GTACCGCTGT GTAG
 
Protein sequence
MQIVIFEDEK VHGFHPLVHF KPVYGLFTGC RNLLQKFCFY LGADVTFSCH LRRYLQPYYR 
SHLPVFQPGV DPQRDILLVN GRLLCDEKAA TIIRDHPPDP GQCLMQGDEL VLARVNEARI
VSADNMLPDY FDTQQLAAES ETVVAEGFRL LRNIWDPVAF HPGELHREAF SLELGSISGR
VSSRAGLENP ESIFIGEGAV IKAGALLDAE EGFVYVSPGA VVEPQVVLAG NVFAGEFSCV
RTGANLHSNV FVGRASKAGG EIEDAVIEPY ANKQHEGFLG HSYISSWCNL GAGTNTSDLR
NNYGKVKLQV ENKEFRTGEQ FLGLLMGEHT KCSINSMFNT GTVAGASSNI FGGGFPPKYI
PSFSWGGPGS GFQPYEIEKA VATARVVMGR RNIRMCDAYE TMFRYVAAVE QDSGTAV