Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2047 |
Symbol | |
ID | 3719437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 642382 |
End bp | 644187 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640070211 |
Product | ThiF family protein |
Protein accession | YP_352099 |
Protein GI | 77462595 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGTCGC CAAACGATGA GGAGATCCCG GATGTTCTGC ATCCGGTTAC CTCCTTGCTG CGTATTGGCG TTGGTCCCGT GACTGCGCTG GAAGGTTGGA AAGAGTGGCG AAGGGGGTTT TTCTCTCTTC CGCTTGTTGC TCGTGTCACA ATTTCGCCCG GCCAGAGCTT TCCTGCCGAA AGCAGATGGC ACCTTGTCGT GTCCTCTGGG TCCTACCCGG CAGATATCTT CATCTTGCCC GACAAGGTGG CAGGTCCGAA CTTGACCTTT CCCCACCAGG CCGCAGTCTA TTCTCGCGAT GGCAAAGAAC CGTGGTTGAA CGGTGAGCCC TGCCTGACAG ATCCGACGGC CGCGTTCGGT GATCGCCACG GATCCCGACC GGAACCCATC GCCCTTGCGG ACCGGCTGAT CTGGAAGGTC GAACGCTTTT CTCGCTGGTG CGAGCTAGCC GCCGCGGGTC GTCTCCATAA TCCCGGTGAT CACTTCGAAC TCCCGCCCCT TAGCGGACAT ACCAATCCGA TGACCATCGG GTTCCATGAG ACAGAGGGGG ATCTCGTCAG ATGGACCCAA GGCGCCGCTA GAGCAGGTAT TGTGCACTTG GTCAGTGTAT CTTCCTCTAG TAAAGTCCTT GCAGCGCAAA GTTGGCGGAC AACTGAGGAT GATCTCCTCC ATATGCCAGA TTGGGGTCTC CTGATAAAGA ATCAGTCGCC GGGACAGATA CCAGCGATAT GGTTCTTGCT CGAAGAGATC CCCATCATCG GTGCTTGGCA GTTGCCCCGG ACATATCGGG AGCTCAGCGA GAGACTTGCA GGAGAAACGA TTGACCTAGC CAAATCTTTA GCGGTGTTGG GCCAAAGCAT GAGGAAAAGG AACATTAGCC GACCGGCCGT AATCATGTTC GGCTTTCCTA TTCCACGGTT CTTTGGAGAT GCCCCTTCGC GCATTCACTG GCTCGCAATA TGTGACGTCC GCTTGACCCG AAAAAACAGC GCAAGGAATG GTTTCCGCTC CTCCGAGGCG AACCGACAAC GGTTTGATCG TGACCTCGCA CGGTCAGCCA AAGCCCTTTC TTGGGCTACA TGTGAAAACT GGGCACCTGA TCAGATAAGG ACGCGCCTGC GTAGCGCGCC GGGCTCGCAG CCGAAGATGC TGCTGATCGG TGCTGGTGCG CTTGGCAGCC AAGTTGCCGA GACGCTCATG CGTACGGGTG TGCGTGACAT TGATGTGCGT GACCGAGATG AACTCGCGGC GGGAAACCTC TGCCGACACG CTCTCGATCT CACCATGATC GCTGTGAACA AGGCCGCAGC CCTAGCCGCG CAACTCAATC GCCTTCAACC TGACGGAAGA GCCCGAGGCG ATGATCGGGC CTTCCCAACT GTCGGACCGG ACGGGCCTGA GAATGCCGAT GACTATGATG TCATCCTCGA TTGCACCGGC GAGAACAAGC TGTTGCGCAG CCTTTCGATG ATACCCTGGA AGTCCGAAAA ACTGTTCATC TCCCTCTCTG TAAACTGGGG CGCGCGGGGT CTCATGTTCT GGTCATCTCG GGGCGCATCA TTTCCGGCAG TTGATGCAAC GGAACGTCTC GAGGCGCTTG CAAGAAAGTT CAGGCCCGAG GGAATCGATG AGCGGTTTGA GGGGATCGGC TGCTGGCATC CGGTCTTCCC AGCTGATGCA GCAGACATCC GAATCTGGGC CGGAATGGGG GCAAGATATG TGCTGGACCA GATCGAGGGC GGCCCAGAGG GGTGCGGCAT CTATTTCTTC AACGACGATC GGACCATCGG ACTTGAGCAT GGATAG
|
Protein sequence | MKSPNDEEIP DVLHPVTSLL RIGVGPVTAL EGWKEWRRGF FSLPLVARVT ISPGQSFPAE SRWHLVVSSG SYPADIFILP DKVAGPNLTF PHQAAVYSRD GKEPWLNGEP CLTDPTAAFG DRHGSRPEPI ALADRLIWKV ERFSRWCELA AAGRLHNPGD HFELPPLSGH TNPMTIGFHE TEGDLVRWTQ GAARAGIVHL VSVSSSSKVL AAQSWRTTED DLLHMPDWGL LIKNQSPGQI PAIWFLLEEI PIIGAWQLPR TYRELSERLA GETIDLAKSL AVLGQSMRKR NISRPAVIMF GFPIPRFFGD APSRIHWLAI CDVRLTRKNS ARNGFRSSEA NRQRFDRDLA RSAKALSWAT CENWAPDQIR TRLRSAPGSQ PKMLLIGAGA LGSQVAETLM RTGVRDIDVR DRDELAAGNL CRHALDLTMI AVNKAAALAA QLNRLQPDGR ARGDDRAFPT VGPDGPENAD DYDVILDCTG ENKLLRSLSM IPWKSEKLFI SLSVNWGARG LMFWSSRGAS FPAVDATERL EALARKFRPE GIDERFEGIG CWHPVFPADA ADIRIWAGMG ARYVLDQIEG GPEGCGIYFF NDDRTIGLEH G
|
| |