Gene TM1040_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0462 
Symbol 
ID4078344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp480351 
End bp481427 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID638005758 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_612457 
Protein GI99080303 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.987597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTGG TACTGGGGCT TGCGGGGCTG ATCTGGTGGG GCGGGCGGCT CTTTGGGGCA 
TCGCAGTCGA TGCGCTGGGC GTTGCTTGGT CTTTTGTTCC TGGCGGTTCT CAGCATCCAG
CTGACCTTGC CCGAAGGCAA TCCATTGCGT GCCGCCACTG GGGGATCTGC GAGCCTGTGG
CTCCTGATTG CCGGGACAGG GGCCTTGGTT GCGCTCTATG GCCGCGTGTT GAAGCGGCTG
CGCAACCGTG CGGATGCGCG CGAGGACAGT GCTGATGCGC CTGAGGCGGC TGCTCCGTTT
CGGGACTCGG AACTGGACCG CTATGCCCGC CACATCGTCC TGCGCGAGGT CGGAGGCGCT
GGGCAAAAGC GCCTGAAAGA CGCCAGAGTT CTGGTGATCG GAGCGGGTGG CCTGGGGGCT
CCCGCGCTGC AGTATCTGGC CGCTGCGGGC GTCGGCACCA TCGGCGTTAT TGACGATGAT
CGCGTCGAGA ACGCCAATCT GCAACGACAG GTTATCCATC GTGACGCCGA TATTGGCATG
CCCAAGGTCT TCTCGGCTCA GGCCGCGATG GAGGCGCAAA ACCCTTTTGT GACCGTCCGT
CCCTATCATC GTCGCCTCAG CGAGGATATC GCATCGGAAC TCTTTGCGGA ATATGACCTG
ATCCTCGACG GGACTGACAA TTTCGACACT CGCTATCTGG CCAATGCGGC GGCGGTTGCG
CAGAGGAAAC CCCTGATTTC CGGCGCCTTG TCGCAGTGGG AAGGCCAGAT CTCCGTTTTT
GATCCCGCCT CGGGCGGCCC GTGTTACCAG TGTATTTTTC CCGAAAGCCC CGCCGCTGGC
CTTGCGCCCA GCTGTGCAGA GGCGGGTGTA ATTGGACCTT TGCCCGGCGT TTTGGGCGCG
ATGATGGCTG TGGAGGCCGT GAAACAGATC ACAGGCGCAG GAGAAGTCTT GCGCGCGCAG
ATGTTGATCT ACGATGGTCT CTATGGTGAA ACCCGCCGGA TTGCGTTGAA AGCGCGCGCC
GATTGTCCGA TTTGCGGACC CAATGCAGGC ACCCCGGCCC CGACAGGAGA AAACTGA
 
Protein sequence
MILVLGLAGL IWWGGRLFGA SQSMRWALLG LLFLAVLSIQ LTLPEGNPLR AATGGSASLW 
LLIAGTGALV ALYGRVLKRL RNRADAREDS ADAPEAAAPF RDSELDRYAR HIVLREVGGA
GQKRLKDARV LVIGAGGLGA PALQYLAAAG VGTIGVIDDD RVENANLQRQ VIHRDADIGM
PKVFSAQAAM EAQNPFVTVR PYHRRLSEDI ASELFAEYDL ILDGTDNFDT RYLANAAAVA
QRKPLISGAL SQWEGQISVF DPASGGPCYQ CIFPESPAAG LAPSCAEAGV IGPLPGVLGA
MMAVEAVKQI TGAGEVLRAQ MLIYDGLYGE TRRIALKARA DCPICGPNAG TPAPTGEN