Gene TM1040_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1937 
Symbol 
ID4076888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2040105 
End bp2041208 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content59% 
IMG OID638007253 
Productputative GTP cyclohydrolase 
Protein accessionYP_613932 
Protein GI99081778 
COG category[S] Function unknown 
COG ID[COG1469] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.166186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC ATTCTCGTGA TGTAAACGAA ACGCCAGATC GCTCGGACGC GGAACAAGCG 
CTTGCGGTGC TGCGGCGCTG GGCGGGCGAA GCAAGCGAGA CCGAAGTGGC GCAGCTCGAC
CCTGCGATTG CGCGCCTGCT GCCCGGGCAG GAATTGCAGA ATTACCCCGA CCTCAAGCGC
CAGTACCCGG ACGACTTTGA TGCAAACGAG TCCTACCGCG CCACGCTCCC GGATCTTCAG
AACGGCCCTT CCAGCCTGAT CCGCGGCGCC AAGGAGCAGA TCCAGCATGT CGGTATCTCC
AATTTCCGCC TGCCGATCCG GTTTCATACG CGGGACAACG GTGATCTGAC GCTCGAGACC
AGCGTAACCG GCACCGTCAG CTTGGATGCG GAGAAAAAAG GCATCAACAT GTCGCGCATC
ATGCGCAGCT TTTACAAACA TGCTGAGAAG GTCTTTTCTT TCGACGTGAT GGAAGCAGCT
CTTGAGGATT ATCTGAGCGA TCTTGAGAGT GGCGACGCGC GGCTGCAGAT GCGGTTTTCC
TTCCCTGTGA AGGTGCAGAG CCTGCGCTCG GGTCTTTCGG GCTATCAGTA TTACGACGTG
GCGCTGGAAC TGGTGCAGAT GGCTGGGCAA CGCCATCGCA TCGTGCATCT GGACTATGTC
TATTCTTCGA CCTGCCCGTG CTCGCTTGAG CTTTCGGAAC ATGCCCGTCA GGCGCGTGGA
CAGCTGGCGA CCCCGCACTC GCAGCGCTCT GTTGCGCGGA TTTCCGTGCA GATGGAACAG
GATGGCGGCT GTTTGTGGTT CGAGGATCTG ATCGACCATT GCCGTCGCGC GGTGCCGACC
GAGACGCAGG TGATGGTGAA GCGCGAAGAC GAACAGGCAT TTGCGGAGTT GAACGCGGCC
AATCCGATCT TTGTGGAAGA TGCGGCGCGC CTCTTTTGTG AAGCGCTTCA GAGCGATGCG
CGGGTGGGGG ATTTTCGGGT TGTGGCGAGC CATCAGGAAA GCCTGCACAG CCATGACGCG
GTCTCTGTGC TGACGCAGGG CACGATGTTT GCGGCGCCGA GCCTCGACCC GCAGCTATTC
TCGACTTTGA TCCATCGCGG TTAG
 
Protein sequence
MNIHSRDVNE TPDRSDAEQA LAVLRRWAGE ASETEVAQLD PAIARLLPGQ ELQNYPDLKR 
QYPDDFDANE SYRATLPDLQ NGPSSLIRGA KEQIQHVGIS NFRLPIRFHT RDNGDLTLET
SVTGTVSLDA EKKGINMSRI MRSFYKHAEK VFSFDVMEAA LEDYLSDLES GDARLQMRFS
FPVKVQSLRS GLSGYQYYDV ALELVQMAGQ RHRIVHLDYV YSSTCPCSLE LSEHARQARG
QLATPHSQRS VARISVQMEQ DGGCLWFEDL IDHCRRAVPT ETQVMVKRED EQAFAELNAA
NPIFVEDAAR LFCEALQSDA RVGDFRVVAS HQESLHSHDA VSVLTQGTMF AAPSLDPQLF
STLIHRG