Gene Rsph17029_2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2250 
Symbol 
ID4896738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2383628 
End bp2384674 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content74% 
IMG OID640112844 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001044125 
Protein GI126463011 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0655249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.413698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGC TGGTTCTTCT GGCCGTGGTG CTCTGGGGGC TGGGCTGGGC CTTCGGCGTG 
CCGGGACGTC TCCGGCTGCT GATGCTGGCC TGTCTCTGGT TCGCGGTGGT GCTGGCCCAT
CTGCTGCTGC CCGAGGCCCA TCCCTTGCGG GCGGCCACCG GCGGCCGCGC CGAGCCCTGG
CTGGTGCTGG GCGGGGTTGC GGTGCTGGGG CTGGGCTACC GGCTGGCGCT CGGGGCCCTG
CGGCGGCGCG CGGCACCCGT TCGGCCGGCC GCGGCGCCGG GCGCCTTCCG GCCGGCCGAG
CTCGAGCGCT ATGCGCGCCA TATCCTGCTG CGCGAGGTGG GGGGGCCGGG ACAGAAGCGG
CTGAAGCAGG CGCGCGTGCT GGTGGTGGGC GCCGGGGGGC TGGGCTCGCC CGCCCTGCTC
TATCTCGCGG CGTCGGGAGT GGGGACGGTG GGGGTGATCG ATGCCGACCA GGTCGAGGCC
TCGAACCTGC AGCGGCAGGT GATCCATACC GATGCGCGGA TCGGCTGGCC CAAGGTCCAT
TCCGCGGCCG AGGCGATGCG GGCGCTCAAC CCCTTCATCG AGGTGCGGCC CTACGAGCGT
CGGCTGACCG AGGAGAATGC GGCGGCACTG CTGGCCGACT ATGACCTGAT CCTCGACGGG
ACCGACAATT TCGACACGCG CTATCTCGTC AACCGGGTGG CGGTGGCGGC GGGCAAGCCG
CTCATCGCGG GCGCCATCGC GCAGTGGGAA GGGCAGGTGA GCCTCTACCA TCCGGCGGCG
GGCGGGCCCT GCTTCCAGTG CACCTTCCCC GAGCGGCCGG CGCCGGGCCT CGTGCCCACC
TGCGCCGAGG CGGGTGTGAT CGCGCCGCTG CCGGGCGTGG TGGGCTCGAT CATGGCGGTC
GAGGCGGTGA AGCATCTGAC CGGCGCGGGA GCCACGCTGC GCGGTGCGCT GCTGATCTAC
GATGCACTCT GGGGCGAGAC GCGGCGGATC GGGCTGAAGC CGCGCCCCGG CTGCTCGGTC
TGCGGCGGCG CGGGCAAGGC CGGCTGA
 
Protein sequence
MLALVLLAVV LWGLGWAFGV PGRLRLLMLA CLWFAVVLAH LLLPEAHPLR AATGGRAEPW 
LVLGGVAVLG LGYRLALGAL RRRAAPVRPA AAPGAFRPAE LERYARHILL REVGGPGQKR
LKQARVLVVG AGGLGSPALL YLAASGVGTV GVIDADQVEA SNLQRQVIHT DARIGWPKVH
SAAEAMRALN PFIEVRPYER RLTEENAAAL LADYDLILDG TDNFDTRYLV NRVAVAAGKP
LIAGAIAQWE GQVSLYHPAA GGPCFQCTFP ERPAPGLVPT CAEAGVIAPL PGVVGSIMAV
EAVKHLTGAG ATLRGALLIY DALWGETRRI GLKPRPGCSV CGGAGKAG