Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0462 |
Symbol | |
ID | 4078344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 480351 |
End bp | 481427 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005758 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_612457 |
Protein GI | 99080303 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.987597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTGG TACTGGGGCT TGCGGGGCTG ATCTGGTGGG GCGGGCGGCT CTTTGGGGCA TCGCAGTCGA TGCGCTGGGC GTTGCTTGGT CTTTTGTTCC TGGCGGTTCT CAGCATCCAG CTGACCTTGC CCGAAGGCAA TCCATTGCGT GCCGCCACTG GGGGATCTGC GAGCCTGTGG CTCCTGATTG CCGGGACAGG GGCCTTGGTT GCGCTCTATG GCCGCGTGTT GAAGCGGCTG CGCAACCGTG CGGATGCGCG CGAGGACAGT GCTGATGCGC CTGAGGCGGC TGCTCCGTTT CGGGACTCGG AACTGGACCG CTATGCCCGC CACATCGTCC TGCGCGAGGT CGGAGGCGCT GGGCAAAAGC GCCTGAAAGA CGCCAGAGTT CTGGTGATCG GAGCGGGTGG CCTGGGGGCT CCCGCGCTGC AGTATCTGGC CGCTGCGGGC GTCGGCACCA TCGGCGTTAT TGACGATGAT CGCGTCGAGA ACGCCAATCT GCAACGACAG GTTATCCATC GTGACGCCGA TATTGGCATG CCCAAGGTCT TCTCGGCTCA GGCCGCGATG GAGGCGCAAA ACCCTTTTGT GACCGTCCGT CCCTATCATC GTCGCCTCAG CGAGGATATC GCATCGGAAC TCTTTGCGGA ATATGACCTG ATCCTCGACG GGACTGACAA TTTCGACACT CGCTATCTGG CCAATGCGGC GGCGGTTGCG CAGAGGAAAC CCCTGATTTC CGGCGCCTTG TCGCAGTGGG AAGGCCAGAT CTCCGTTTTT GATCCCGCCT CGGGCGGCCC GTGTTACCAG TGTATTTTTC CCGAAAGCCC CGCCGCTGGC CTTGCGCCCA GCTGTGCAGA GGCGGGTGTA ATTGGACCTT TGCCCGGCGT TTTGGGCGCG ATGATGGCTG TGGAGGCCGT GAAACAGATC ACAGGCGCAG GAGAAGTCTT GCGCGCGCAG ATGTTGATCT ACGATGGTCT CTATGGTGAA ACCCGCCGGA TTGCGTTGAA AGCGCGCGCC GATTGTCCGA TTTGCGGACC CAATGCAGGC ACCCCGGCCC CGACAGGAGA AAACTGA
|
Protein sequence | MILVLGLAGL IWWGGRLFGA SQSMRWALLG LLFLAVLSIQ LTLPEGNPLR AATGGSASLW LLIAGTGALV ALYGRVLKRL RNRADAREDS ADAPEAAAPF RDSELDRYAR HIVLREVGGA GQKRLKDARV LVIGAGGLGA PALQYLAAAG VGTIGVIDDD RVENANLQRQ VIHRDADIGM PKVFSAQAAM EAQNPFVTVR PYHRRLSEDI ASELFAEYDL ILDGTDNFDT RYLANAAAVA QRKPLISGAL SQWEGQISVF DPASGGPCYQ CIFPESPAAG LAPSCAEAGV IGPLPGVLGA MMAVEAVKQI TGAGEVLRAQ MLIYDGLYGE TRRIALKARA DCPICGPNAG TPAPTGEN
|
| |