Gene CPS_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4049 
SymbolcofG 
ID3522494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4258459 
End bp4259637 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content47% 
IMG OID637286494 
ProductFO synthase subunit 1 
Protein accessionYP_270706 
Protein GI71282022 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGT TAACAGCAGT ACAGGCCAGC CAACTAGGCG ATGTTCGCGG CAAAGCGTTG 
AATACGCTCC GACAAGCCGC CTGTGCTGTG CGTGATCAAC ATTGGGGCAA GATCATGACG
TATTCAAGAA AGGTTTTTAT TCCTCTGACC AACATGTGCC GCGATGAGTG CCAATACTGC
ACCTTCGTTC AGCGTCCAGA ATCCGGTAAT GCAACCATCA TGACGCCGGA GCAGGTATTG
ACCGTCATTC GTCAGGGTCA GGCAATGGGC TGCAAAGAAG TGTTGTTGAG CCTAGGTGAA
AAACCTGAAC TACGCTATCG CGAAGCAAGA GAAGCACTCG CTGAACAAGG TTTTTCCACG
ATGATGGAAT ACGTAGCTGA GATCAGTGCA CTAATACTGC GTGAAACCAG CCTGTTACCA
CACGTTAATG CCGGTACTAT GACCGCTGAT GAGTTGGCAA ATATTAAAAA AGTCAGCGCT
AGCATGGGCA TGATGCTTGA AACTGTCAGT GAACGATTAT TGCAAAAAGG ACAAGCGCAC
TACGCCTGTC CAGACAAAGT TCCCGCTACA CGTTTGGCAA CCATCAAAAG TGCGGGCGAG
CAGAACATTC CTTATACCAC TGGTATTTTG ATAGGTATTG GTGAAACGTG GCAGGAACGT
GTAGAAAGTC TTGAAGCCAT TAATAATCTT CACCTCAAAT ATGGTCACAT TCAGGAAGTT
ATCGTGCAGA ATTTTTGTGC CAAATCCGGC ACGGCCATGG CTGACCATCC AGAGCCTGAT
CTTGAAGATA TGTTGCGGAC TTTGGCCGTC GCTCGTCTAA TGCTCGATCC GAGTATTAGT
ATTCAGGCAC CACCAAATCT ACAACAACGC TATAAAGATT ATATCGGCAG CGGTATTAAT
GATTGGGGTG GAATTTCGCC GTTAACCAAG GACTTTATCA ACCCAGAAAG AGCCTGGCCT
CAAATTGAGC AATTAGCCAA GGCGACTCAG GACTGTGGTT ATCAGTTACA AGAACGCCTA
GCTGTTTACC CTGAATATCT TAAACAACAA TATCTTAGCC CACAAATCTC AAAGCGACTC
GAAGGTATGG CTCGCGCCGA CGGTTTAGCC TCTCAACAAT GTGTTACTGC AGAATCGGCA
AAGCATGCCG CTGATATGAT CTACCACGTG GCCCTTTAA
 
Protein sequence
MDKLTAVQAS QLGDVRGKAL NTLRQAACAV RDQHWGKIMT YSRKVFIPLT NMCRDECQYC 
TFVQRPESGN ATIMTPEQVL TVIRQGQAMG CKEVLLSLGE KPELRYREAR EALAEQGFST
MMEYVAEISA LILRETSLLP HVNAGTMTAD ELANIKKVSA SMGMMLETVS ERLLQKGQAH
YACPDKVPAT RLATIKSAGE QNIPYTTGIL IGIGETWQER VESLEAINNL HLKYGHIQEV
IVQNFCAKSG TAMADHPEPD LEDMLRTLAV ARLMLDPSIS IQAPPNLQQR YKDYIGSGIN
DWGGISPLTK DFINPERAWP QIEQLAKATQ DCGYQLQERL AVYPEYLKQQ YLSPQISKRL
EGMARADGLA SQQCVTAESA KHAADMIYHV AL