Gene RPD_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0035 
Symbol 
ID4020489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp40554 
End bp41921 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content66% 
IMG OID637960211 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_567176 
Protein GI91974517 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.292167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG CCACGCAGCA GAATGCCATG ATGCTCGCTT CCCTGCTCGG CGTCGGCGAG 
GCGGAAGCCG GCGAACGCCT GGCGCGAACC GTACTGATCA CGGCGGCCCC GGGATGGAAA
TCTGGCTGGG CCGTTGAGGT CGGCGAGCTT ATCGGTCGCA CCGTCCAGGT GTCGCACCAG
CAGGAACCCA CCGATCCGGA CCTGGAGTTG GTGATCGGCG ATGTGACCCC GCGAACGTCG
GCTCGGCGCG TGTATGCTGA CCTCGGCTCC GAAGGCGCGG CCGCTTCCCT CGAACCTGTC
GCGAAGCTGG CCGGAGAGCC CCACGGTCTC TATGCGGCGG CCGCCGCTTG TGCGGTATCT
GCCGTTGTAG TCCATGCCGT GATCGACGCG GCCGATCTGC CGCAGGCCCG ACTACCCATG
CGGCTGGATT ACGCGCAGCT TGGCGTCCCG AATGGCGCCC TCGACCTACG GGTCGACGTA
GGCCACGCCG TGATGGCGGG CGCCGGGGCC GTCGCCCATG CTTTTCTCAA GGCGGCCCGT
CACATCGATA TTCACGGCGA TCTCGCGATT GTCGACCCCA AGGTGGTGCA AGGTGGCATC
CTGAACCGCT GTCTCTACCT TGAGGACAAC GACGTCGACC GTCAGAAGGC CGAGGTCTTG
GCGGAGCGCG CGCAACGCGA TTTCCCGCAT TTGCGGCTGC TGCCGTTCGT AACCGATTTC
AAAGCGTACG TCCGTCAGCT TGGGCATCCC CCCGAGACCG TGTTCGTGAC GGTGGATAGC
CGGCTCGTCC GGCGCTCGAT CCAACTCGAG GTGCCCCGGC GCATCATCGA CGCGTCGACG
ACCGACGCCA GCGGCGTGAT CGTCCATTCG AACGTTCTTC CCACGCAGCA CGCCTGCCTC
GCATGCATCT ATCGGCACGT TCCGGAGGAG CACGCCCGCG AACGATCGAT CGCGGAGGGG
CTCGGCGTCG ATTTGGCCGA CGTTCAAGCC GGCCTGATCA CCGCCGAGGT GGCCCGACGG
ATCGTGCGGA CGCACAAATC GATTGATGGC GATGCGATCG TCGGTCTGGC CTTCGACAGC
CTGTTCCGGC AGCTGTGCTC TGAACAGGCG CTCGCCACGC CGGAAGGGCG GCAGGTCCTG
GCACCATTTG CGTTCGTCTC CGCTTGGGCG GGCGTGATGA TGGCAGTGGA GATGCTGAGG
TCGTTCGCCG GCGCCGCGAA GACCAACTAT TGGTCCGTCG ACCCTTGGAA TACGCCGAAG
GCGCGGGGGC GGATGCTCCG CCAGCGACAC CCGGAGTGCC AATTCTGCTC GAAGCCCGAG
TACGAACCGA TCATTCAGTC CCTGTGGGGA GAGCTCGCCG AGGCGTGA
 
Protein sequence
MNKATQQNAM MLASLLGVGE AEAGERLART VLITAAPGWK SGWAVEVGEL IGRTVQVSHQ 
QEPTDPDLEL VIGDVTPRTS ARRVYADLGS EGAAASLEPV AKLAGEPHGL YAAAAACAVS
AVVVHAVIDA ADLPQARLPM RLDYAQLGVP NGALDLRVDV GHAVMAGAGA VAHAFLKAAR
HIDIHGDLAI VDPKVVQGGI LNRCLYLEDN DVDRQKAEVL AERAQRDFPH LRLLPFVTDF
KAYVRQLGHP PETVFVTVDS RLVRRSIQLE VPRRIIDAST TDASGVIVHS NVLPTQHACL
ACIYRHVPEE HARERSIAEG LGVDLADVQA GLITAEVARR IVRTHKSIDG DAIVGLAFDS
LFRQLCSEQA LATPEGRQVL APFAFVSAWA GVMMAVEMLR SFAGAAKTNY WSVDPWNTPK
ARGRMLRQRH PECQFCSKPE YEPIIQSLWG ELAEA