Gene TM1040_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1989 
Symbol 
ID4077173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2092984 
End bp2094432 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content65% 
IMG OID638007304 
Productcobyric acid synthase 
Protein accessionYP_613983 
Protein GI99081829 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1492] Cobyric acid synthase 
TIGRFAM ID[TIGR00313] cobyric acid synthase CobQ 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.56437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAAG GCACCGGCAG CAATGTCGGA AAATCCATGC TGGTGGCAGG GCTAGCACGG 
GCCTTGCGCA AACGCGGCCT CTCGGTGGCG CCCTTCAAAC CGCAGAACAT GTCCAACAAC
GCCGCCGTCA CCTCGGACGG GGGGGAGATC GGCCGCGCCC AGGCCCTGCA GGCCCGCGCG
GCGGGGCTTG CGCCGCATAC GGATATGAAC CCGGTGCTCC TGAAGCCCGA AACCGATACC
GGCGCGCAGG TCATCGTGCA GGGCAAGCGG CGCGGCACCC GCGCGGCGGG GTCGTTTATG
CGCGACAAGG CGGGCCTTCT GGAAGCCACG CTCGAGAGCT TTCACCGCCT CGCAGCGCAG
CATGACATTG TCCTCATCGA GGGCGCAGGC TCTCCGGCAG AAACCAATCT GCGCAAGGGC
GACATCGCCA ATATGGGCTT TGCCGAAGCT GCAGGCGTAC CTGTCTTGCT GGTGGGCGAC
ATCCATCGGG GCGGGGTGAT CGCGCAGATC GTTGGCACCC ATACGGTGTT GGAGCCAAGC
GACCGCGCGC GGATCAAAGC CTTCGCCGTC AATCGCTTCC GGGGCGACCT TAGCCTTTTT
GATGGCGGGC GGGATGACAT TGCGCGCTGG ACGGGCTGGC CTTCGCTGGG GGTGGTGCCA
TGGTTCTGGG ATGCGTGGAA ACTGCCGGCC GAGGATATGA TGGACATCGC CTCCCACAAG
GGCGGCGCTT GCAAGGTGGT GGTGCCGCAG CTTGAACGCA TGGCGAATTT CGACGACCTC
GACCCGCTTG CAGCAGAACC TGCGGTGACG GTCGAGATCG TGCCCCCCGG GCGCGCCCTG
CCCGGTGATG CGGATCTGGT GCTGATCCCC GGCTCCAAAT CCACTATCGG CGATCTGGCC
TATCTGCGCA CGCAGGGCTG GGACATCGAC ATCCTCGCTC ATCACCGGCG CGGCGGACAT
GTGCTCGGGC TTTGTGGCGG CTATCAGATG CTCGGCCAGA GTATCGACGA TCCCGAAGGT
GTCGATGGCC ATCCCGGCAA AGTCGCGGGG CTTGGCCTCT TGGATGTCCA CACTGTTATG
GCCGGAGACA AGCGCGTCAC CCTGAGCGCG GCGCGCACAC TCGAGGGGGA TCTGCCTGTT
TCTGGCTATG AGATCCACAT GGGCCGCACC ACGGGGCCGG ATTGCGCGCG GGCCTGGCTC
GCGCTCGAAG GCCGCGCGGA GGGGGCGACC TCTGCCGATG GGCGTGTGCG CGGCTCTTAT
CTGCACGGGC TTTTTACATC GGACGCGTTT CGGGCACAGT TCCTCTCCGA CCTCGGACAC
CAGTCCGATC TGGACTATGA CGCCGGGGTC GAGGCGACGC TTGATGAGCT TGCAGCCCAT
CTTGAACAAT ATATGGATGT GGAAGGCCTG CTCGAACTGG CCGAACCCAT TCCTGTGCCT
GAATCCTGA
 
Protein sequence
MIQGTGSNVG KSMLVAGLAR ALRKRGLSVA PFKPQNMSNN AAVTSDGGEI GRAQALQARA 
AGLAPHTDMN PVLLKPETDT GAQVIVQGKR RGTRAAGSFM RDKAGLLEAT LESFHRLAAQ
HDIVLIEGAG SPAETNLRKG DIANMGFAEA AGVPVLLVGD IHRGGVIAQI VGTHTVLEPS
DRARIKAFAV NRFRGDLSLF DGGRDDIARW TGWPSLGVVP WFWDAWKLPA EDMMDIASHK
GGACKVVVPQ LERMANFDDL DPLAAEPAVT VEIVPPGRAL PGDADLVLIP GSKSTIGDLA
YLRTQGWDID ILAHHRRGGH VLGLCGGYQM LGQSIDDPEG VDGHPGKVAG LGLLDVHTVM
AGDKRVTLSA ARTLEGDLPV SGYEIHMGRT TGPDCARAWL ALEGRAEGAT SADGRVRGSY
LHGLFTSDAF RAQFLSDLGH QSDLDYDAGV EATLDELAAH LEQYMDVEGL LELAEPIPVP
ES