Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3654 |
Symbol | |
ID | 4075623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 708264 |
End bp | 709220 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005174 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_611883 |
Protein GI | 99078625 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCT ATGCTCGCCA GATGATGCTG CCCGAGGTGG GGGCTGTAGG GCAGGCGCGT CTCAGCACTG CGCGCGTTCT GGTTGTAGGG GCAGGGGGTC TCGCTGCGCC GGTTCTGCCG CTCTTAGCGG GGGCCGGGAT CGGACATATC ACCCTGATCG ACGGCGATGT TGTGAGCCTG TCCAATCTGC ATCGCCAGAC CTTGTTTCAA GAAACCGACT GTGGCCGTCC CAAAGCCGAA GTCGCCGCGC AGCGCTGCAG CGCCCTCAAC AGTGAAATTG AGATCGTCGC GGTTGCACAT GCGCTCACTC CAGCCAATGC GCCGCTCGTT CTTGCAGATG TGGATCTCGT GCTCGATTGT GCGGACAGCT ATGCCGTGAG CTACCTCCTG AGCGATCTCT GCCATGCCCA GAAGACTCCG CTTATCAGCG CTTCGGTGCT GGGGTCAGGC GGATATGTTG GCGGTTTTTG CGGTGGGGCG CCTTCCTTGC GGGCGGTGTT CCCCGATGCC CCCGACAACA GTGCCAGCTG TGAGACGGCA GGTGTCTATG GCCCTGTGGT TGGAATGATT GGCGCGTTGC AGGCTCAGAT GGCGCTCAAT ATTCTCTTGG AACATGTGCC CTCGCCTCTG GGCCAAATGG TACAGCTGGA TTGTCGCAGC TATCGCTCGA CGACCTTTCG TTTCGACCAT GCGCCCGAAC CCGAGGTGAG CTTTCCGTTT GTAGCCATCG AAGAGCTACA GGCAGATGAT CACATCATCG AGTTGCGCGC AGATGCGCCA CTGCTTCACC CAAAGGCCAG GCGATCGGAC GCAGAGATGC TGTTGCAGAC CCTGCCCAAT CCCCAAAAAC GTTTGGTGCT GTGCTGCGCC ACGGGTCTGC GGGCCTGGCG CACGGCAGAA AGAATTCACC CCATCTGGCC GGGCGAGATC GTTCTCGTCG CGGCCTCCGC ATCCTAA
|
Protein sequence | MSRYARQMML PEVGAVGQAR LSTARVLVVG AGGLAAPVLP LLAGAGIGHI TLIDGDVVSL SNLHRQTLFQ ETDCGRPKAE VAAQRCSALN SEIEIVAVAH ALTPANAPLV LADVDLVLDC ADSYAVSYLL SDLCHAQKTP LISASVLGSG GYVGGFCGGA PSLRAVFPDA PDNSASCETA GVYGPVVGMI GALQAQMALN ILLEHVPSPL GQMVQLDCRS YRSTTFRFDH APEPEVSFPF VAIEELQADD HIIELRADAP LLHPKARRSD AEMLLQTLPN PQKRLVLCCA TGLRAWRTAE RIHPIWPGEI VLVAASAS
|
| |