Gene RSP_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2047 
Symbol 
ID3719437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp642382 
End bp644187 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content57% 
IMG OID640070211 
ProductThiF family protein 
Protein accessionYP_352099 
Protein GI77462595 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTCGC CAAACGATGA GGAGATCCCG GATGTTCTGC ATCCGGTTAC CTCCTTGCTG 
CGTATTGGCG TTGGTCCCGT GACTGCGCTG GAAGGTTGGA AAGAGTGGCG AAGGGGGTTT
TTCTCTCTTC CGCTTGTTGC TCGTGTCACA ATTTCGCCCG GCCAGAGCTT TCCTGCCGAA
AGCAGATGGC ACCTTGTCGT GTCCTCTGGG TCCTACCCGG CAGATATCTT CATCTTGCCC
GACAAGGTGG CAGGTCCGAA CTTGACCTTT CCCCACCAGG CCGCAGTCTA TTCTCGCGAT
GGCAAAGAAC CGTGGTTGAA CGGTGAGCCC TGCCTGACAG ATCCGACGGC CGCGTTCGGT
GATCGCCACG GATCCCGACC GGAACCCATC GCCCTTGCGG ACCGGCTGAT CTGGAAGGTC
GAACGCTTTT CTCGCTGGTG CGAGCTAGCC GCCGCGGGTC GTCTCCATAA TCCCGGTGAT
CACTTCGAAC TCCCGCCCCT TAGCGGACAT ACCAATCCGA TGACCATCGG GTTCCATGAG
ACAGAGGGGG ATCTCGTCAG ATGGACCCAA GGCGCCGCTA GAGCAGGTAT TGTGCACTTG
GTCAGTGTAT CTTCCTCTAG TAAAGTCCTT GCAGCGCAAA GTTGGCGGAC AACTGAGGAT
GATCTCCTCC ATATGCCAGA TTGGGGTCTC CTGATAAAGA ATCAGTCGCC GGGACAGATA
CCAGCGATAT GGTTCTTGCT CGAAGAGATC CCCATCATCG GTGCTTGGCA GTTGCCCCGG
ACATATCGGG AGCTCAGCGA GAGACTTGCA GGAGAAACGA TTGACCTAGC CAAATCTTTA
GCGGTGTTGG GCCAAAGCAT GAGGAAAAGG AACATTAGCC GACCGGCCGT AATCATGTTC
GGCTTTCCTA TTCCACGGTT CTTTGGAGAT GCCCCTTCGC GCATTCACTG GCTCGCAATA
TGTGACGTCC GCTTGACCCG AAAAAACAGC GCAAGGAATG GTTTCCGCTC CTCCGAGGCG
AACCGACAAC GGTTTGATCG TGACCTCGCA CGGTCAGCCA AAGCCCTTTC TTGGGCTACA
TGTGAAAACT GGGCACCTGA TCAGATAAGG ACGCGCCTGC GTAGCGCGCC GGGCTCGCAG
CCGAAGATGC TGCTGATCGG TGCTGGTGCG CTTGGCAGCC AAGTTGCCGA GACGCTCATG
CGTACGGGTG TGCGTGACAT TGATGTGCGT GACCGAGATG AACTCGCGGC GGGAAACCTC
TGCCGACACG CTCTCGATCT CACCATGATC GCTGTGAACA AGGCCGCAGC CCTAGCCGCG
CAACTCAATC GCCTTCAACC TGACGGAAGA GCCCGAGGCG ATGATCGGGC CTTCCCAACT
GTCGGACCGG ACGGGCCTGA GAATGCCGAT GACTATGATG TCATCCTCGA TTGCACCGGC
GAGAACAAGC TGTTGCGCAG CCTTTCGATG ATACCCTGGA AGTCCGAAAA ACTGTTCATC
TCCCTCTCTG TAAACTGGGG CGCGCGGGGT CTCATGTTCT GGTCATCTCG GGGCGCATCA
TTTCCGGCAG TTGATGCAAC GGAACGTCTC GAGGCGCTTG CAAGAAAGTT CAGGCCCGAG
GGAATCGATG AGCGGTTTGA GGGGATCGGC TGCTGGCATC CGGTCTTCCC AGCTGATGCA
GCAGACATCC GAATCTGGGC CGGAATGGGG GCAAGATATG TGCTGGACCA GATCGAGGGC
GGCCCAGAGG GGTGCGGCAT CTATTTCTTC AACGACGATC GGACCATCGG ACTTGAGCAT
GGATAG
 
Protein sequence
MKSPNDEEIP DVLHPVTSLL RIGVGPVTAL EGWKEWRRGF FSLPLVARVT ISPGQSFPAE 
SRWHLVVSSG SYPADIFILP DKVAGPNLTF PHQAAVYSRD GKEPWLNGEP CLTDPTAAFG
DRHGSRPEPI ALADRLIWKV ERFSRWCELA AAGRLHNPGD HFELPPLSGH TNPMTIGFHE
TEGDLVRWTQ GAARAGIVHL VSVSSSSKVL AAQSWRTTED DLLHMPDWGL LIKNQSPGQI
PAIWFLLEEI PIIGAWQLPR TYRELSERLA GETIDLAKSL AVLGQSMRKR NISRPAVIMF
GFPIPRFFGD APSRIHWLAI CDVRLTRKNS ARNGFRSSEA NRQRFDRDLA RSAKALSWAT
CENWAPDQIR TRLRSAPGSQ PKMLLIGAGA LGSQVAETLM RTGVRDIDVR DRDELAAGNL
CRHALDLTMI AVNKAAALAA QLNRLQPDGR ARGDDRAFPT VGPDGPENAD DYDVILDCTG
ENKLLRSLSM IPWKSEKLFI SLSVNWGARG LMFWSSRGAS FPAVDATERL EALARKFRPE
GIDERFEGIG CWHPVFPADA ADIRIWAGMG ARYVLDQIEG GPEGCGIYFF NDDRTIGLEH
G