Gene RPB_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3331 
Symbol 
ID3911133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3811502 
End bp3812563 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content66% 
IMG OID637885234 
Productbiotin synthase 
Protein accessionYP_486938 
Protein GI86750442 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.334205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACTCGA TTGATCTGGC CTCACTGGCC GCCGCCACCC CCACCCTTCG CCACGACTGG 
ACACGCGAAC AGGCCGCCGC GATTTACAAC CTGCCGTTCG CCGACCTGAT CTTCCGGGCG
CAGACCATCC ACCGCCAGAG CTTCGACGCC AACGAGGTGC AGTGCAATCA GCTGCTCAAC
GTCAAGACCG GCGGCTGCGC CGAGGATTGC GGCTATTGCA GCCAGTCAGC GCATCACGAC
ACCGCCCTGC CCGCCTCCAA GCTGATGGAC CCCAAGAAGG TCATCGAAGG CGCCAAGGCG
GCTCGCGACG CAGGTGCCAC CCGCTATTGC ATGGGCGCGG CGTGGCGATC GCCGAAGGAT
CGCGACATGG CGCCGGTGAT CGAGATGGTG AAGGGCGTCA AGGCGCTCGG CATGGAAGCC
TGCATGACGC TCGGCATGCT GACGGACGAT CAGGCGCAGC AACTCGCCGA CGCCGGCCTC
GATTACTACA ACCACAACAT CGACACCTCC GAGGAGTTCT ACTCCTCCGT GGTCAAGTCC
CGCAGCTTCG GCGACCGGCT CGACACCCTC GCCACGGTGC AGGACGCCGG CATCAAGGTG
TGCTGCGGCG GCATTCTCGG CCTTGGTGAG AAGCCGACCG ACCGCGTCGA GATGCTGCGC
ACGCTCGCCA ACCTGCCGCA GCATCCGGAG AGCGTGCCGA TCAACATGCT GATCCCGATC
GAGGGCACGC CGATCGCCAA GACCGCCAAG CCGGTCGATC CGTTCGAGTT CGTCCGCACC
ATCGCGCTGG CGCGGATCAT GATGCCGAAG TCGGACGTGC GGCTGGCCGC CGGCCGCACC
GCGATGAGCG ACGAGATGCA GTCGCTGTGT TTCCTCGCCG GCGCCAATTC GATCTTCATC
GGCGACACCC TGCTGACCAC GCCGAACCCC GGCGACAGCA AGGACCGCAT GCTGTTCGCC
CGGCTCGGCA TCACCCCGCG CGAGGGTGCC CCGGTCGAGG CGCACAGCCA CGACCACGAT
CACGATCACC ACGACCATCA TCACGGCCAC AGCCACAGCT GA
 
Protein sequence
MNSIDLASLA AATPTLRHDW TREQAAAIYN LPFADLIFRA QTIHRQSFDA NEVQCNQLLN 
VKTGGCAEDC GYCSQSAHHD TALPASKLMD PKKVIEGAKA ARDAGATRYC MGAAWRSPKD
RDMAPVIEMV KGVKALGMEA CMTLGMLTDD QAQQLADAGL DYYNHNIDTS EEFYSSVVKS
RSFGDRLDTL ATVQDAGIKV CCGGILGLGE KPTDRVEMLR TLANLPQHPE SVPINMLIPI
EGTPIAKTAK PVDPFEFVRT IALARIMMPK SDVRLAAGRT AMSDEMQSLC FLAGANSIFI
GDTLLTTPNP GDSKDRMLFA RLGITPREGA PVEAHSHDHD HDHHDHHHGH SHS