Gene TM1040_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0050 
Symbol 
ID4078713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp52702 
End bp53790 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID638005337 
ProductGTP cyclohydrolase II 
Protein accessionYP_612045 
Protein GI99079891 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.331797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTAA TGCCGACAAT GCTGGAGCGG ATCGCCCGCG CCCGAGTGGA TCTGAGGATG 
GGACTGCCGG TGATCTTGAC CACCGGTGAG AGCGCCATTC TTGCCATTTC CGTCGAGAGT
CTCAGCCCTG AGCGGCTTAC GGATCTGCGC AGCCTCGGCC CGGTGACTCT GGCGCTGACT
GCACGGCGGG CGGCCACGCT GAAGGCGCGC GTTTATGACA ATGATATTGC GCGCGTCACG
GTGCCCAGCG ACGCAGGCCT GGCCTGGGTT CAGGCGCTGG CGGACCCTGC GGATGATCTC
GAGACCCCGA TGAAAGGCCC CCTATTGAGT GAGCGCGAAG GCGATGCGTC CCTGCCGCGC
CTTGCAATAG CAATGGTAAA ATCCGCCCGT CTTCTGCCTG CGACCGCCTA CGTCGCGCTT
GAGAATGCTG GTGACTTTGC GCTGAAACAC GACCTGACCC TGCTGCCGCA GTCCGCAGCC
GAGCCTCTGC TGAATACCAG CTCGCCCCTC CACCCGGTTG CGGCCGCACG CCTGCCGATG
GAGGCCTCCG AGGCAGGTCG CTTGCATATC TTTCGCCCAG AGGATGGCGG CGAGGAGCAC
TATGCGATCG AGATCGGTCG CCCGGATCGG AGCCAGCCAG TCTTGGCGCG GTTACACTCA
GCCTGTTTCA CCGGTGATGT GTTGGGGTCA TTGAAATGTG ACTGCGGACC GCAGCTGCGC
GGAGCATTGA GTCACATGGG CCAAGAGGGC GCGGGCATCC TGCTGTATCT CAACCAAGAG
GGCCGCGGCA TCGGGCTGGC CAATAAGATG CGCGCCTATT CCCTTCAAGA TCAGGGATTT
GACACAGTCG AGGCCAATCA TCGCCTTGGC TTTGAGGATG ATGAGCGCGA TTTTCGCCTT
GGCGCGTCGA TTTTGCGCGA ACTTGGGTTT TCTTCCGTGC GCCTCATGAC CAACAATCCC
GGCAAGATCG CCATGATGGA GAAAACCGGG ATTTCCGTTG TCGAACGCGT ACCGCTCAAG
GTCGGTGAGA ACGCGTTTAA CCGTCACTAT CTCGCGACCA AGGCTGCAAA ATCAGGCCAC
ATGCTATGA
 
Protein sequence
MSLMPTMLER IARARVDLRM GLPVILTTGE SAILAISVES LSPERLTDLR SLGPVTLALT 
ARRAATLKAR VYDNDIARVT VPSDAGLAWV QALADPADDL ETPMKGPLLS EREGDASLPR
LAIAMVKSAR LLPATAYVAL ENAGDFALKH DLTLLPQSAA EPLLNTSSPL HPVAAARLPM
EASEAGRLHI FRPEDGGEEH YAIEIGRPDR SQPVLARLHS ACFTGDVLGS LKCDCGPQLR
GALSHMGQEG AGILLYLNQE GRGIGLANKM RAYSLQDQGF DTVEANHRLG FEDDERDFRL
GASILRELGF SSVRLMTNNP GKIAMMEKTG ISVVERVPLK VGENAFNRHY LATKAAKSGH
ML