Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3331 |
Symbol | |
ID | 3911133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3811502 |
End bp | 3812563 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885234 |
Product | biotin synthase |
Protein accession | YP_486938 |
Protein GI | 86750442 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.334205 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACTCGA TTGATCTGGC CTCACTGGCC GCCGCCACCC CCACCCTTCG CCACGACTGG ACACGCGAAC AGGCCGCCGC GATTTACAAC CTGCCGTTCG CCGACCTGAT CTTCCGGGCG CAGACCATCC ACCGCCAGAG CTTCGACGCC AACGAGGTGC AGTGCAATCA GCTGCTCAAC GTCAAGACCG GCGGCTGCGC CGAGGATTGC GGCTATTGCA GCCAGTCAGC GCATCACGAC ACCGCCCTGC CCGCCTCCAA GCTGATGGAC CCCAAGAAGG TCATCGAAGG CGCCAAGGCG GCTCGCGACG CAGGTGCCAC CCGCTATTGC ATGGGCGCGG CGTGGCGATC GCCGAAGGAT CGCGACATGG CGCCGGTGAT CGAGATGGTG AAGGGCGTCA AGGCGCTCGG CATGGAAGCC TGCATGACGC TCGGCATGCT GACGGACGAT CAGGCGCAGC AACTCGCCGA CGCCGGCCTC GATTACTACA ACCACAACAT CGACACCTCC GAGGAGTTCT ACTCCTCCGT GGTCAAGTCC CGCAGCTTCG GCGACCGGCT CGACACCCTC GCCACGGTGC AGGACGCCGG CATCAAGGTG TGCTGCGGCG GCATTCTCGG CCTTGGTGAG AAGCCGACCG ACCGCGTCGA GATGCTGCGC ACGCTCGCCA ACCTGCCGCA GCATCCGGAG AGCGTGCCGA TCAACATGCT GATCCCGATC GAGGGCACGC CGATCGCCAA GACCGCCAAG CCGGTCGATC CGTTCGAGTT CGTCCGCACC ATCGCGCTGG CGCGGATCAT GATGCCGAAG TCGGACGTGC GGCTGGCCGC CGGCCGCACC GCGATGAGCG ACGAGATGCA GTCGCTGTGT TTCCTCGCCG GCGCCAATTC GATCTTCATC GGCGACACCC TGCTGACCAC GCCGAACCCC GGCGACAGCA AGGACCGCAT GCTGTTCGCC CGGCTCGGCA TCACCCCGCG CGAGGGTGCC CCGGTCGAGG CGCACAGCCA CGACCACGAT CACGATCACC ACGACCATCA TCACGGCCAC AGCCACAGCT GA
|
Protein sequence | MNSIDLASLA AATPTLRHDW TREQAAAIYN LPFADLIFRA QTIHRQSFDA NEVQCNQLLN VKTGGCAEDC GYCSQSAHHD TALPASKLMD PKKVIEGAKA ARDAGATRYC MGAAWRSPKD RDMAPVIEMV KGVKALGMEA CMTLGMLTDD QAQQLADAGL DYYNHNIDTS EEFYSSVVKS RSFGDRLDTL ATVQDAGIKV CCGGILGLGE KPTDRVEMLR TLANLPQHPE SVPINMLIPI EGTPIAKTAK PVDPFEFVRT IALARIMMPK SDVRLAAGRT AMSDEMQSLC FLAGANSIFI GDTLLTTPNP GDSKDRMLFA RLGITPREGA PVEAHSHDHD HDHHDHHHGH SHS
|
| |