Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3221 |
Symbol | |
ID | 3911022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3683141 |
End bp | 3684328 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885123 |
Product | hypothetical protein |
Protein accession | YP_486828 |
Protein GI | 86750332 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.978135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.653443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGCA ATGCCGGCGC GGCGGATGCC GACGGCGAGC GTCTGGAAAT CGTCCGGACC GCCGAGCGGC TGCGCGAGAT CGGCCCGGCC TGGGAAGCGC TGTGGCACGA CGCCGGCGCG CTGGTGTTCC AGAGTCATGC CTGGACCGCC GCGTGGTGGA ACGCGGTGCC CGACCGCCCG CGCCGCGGAT TGTTCATCGT ACTGGCGTGG CGGCACGACA CGCTGGTGGC GGTGCTGCCG CTGGCGACCT GCCGCTGGTA CGGCGTCCGC GTGCTGGAAT GGGCCGCCAA GGACTACTCC GACTATTGCG ACGCGCTGCT GCGCCCGGGC ATCGGCCCGG CTGTGGTGCA GCGGATGTGG GCCCATGCCG ATGTGCAGGG AGGTTTCGAC GCCGCCTATC TCGGCCATGT GCTGCCGACC GCGATCGTGA ACACGCTGAC CGACGGAACG CGCGGCCGCG GCGTCGTGCT GCGTCCCCAC TTCCGGCAGG CCACGAGCCT GCGCGTGGTC GGCCCCTGGA GCAACAGCCA GGCTTGGTTC GACTCGCATT CCGGCAACGC GCGGCGCAAC TATCGCCGCG GTCTCAAGAC CCTTTCAGAC AACGCCAAGG TCGAATTCCG GCTGATGGCA CCGGACGAGC CGCTCGGGCC CGCCTTGCAG CGATGCGCCG AGCTGAAGCG CGCCTGGTGC GCCCGCAACG GCCTGGTGGC GCCGCTGTTC GATGCCGGTT CGCCGATGCT GGAAGCGCTG GTGCAGGTGC TCGCCGACAA CAAGCTGCTG CATGTGTTCG TGCTCGAGCG CGACGGCGTG ATCGTCGCCA TGACGGTCAA CCTGATGCAG CACGCCACCA TGATGGCCTA TGTCACCACT TACGATTCCA GTTTCGAACG CAGTTCGCCC GGCAACATCC TGCTGTTCGA CTACATCCGA TGGTCGATCG ATCACGGCGC GACGACCGTC GATTTCCTGT GCGGCGACGA GGACTACAAA TATCGCTTCA GCAACCAGCA GGTCACCCTG AACTCGTTCG CGGGGGGCCG CACGCTGCTG GGCAAGGCGG CGATCCTGGC GGACAAGGCG CTGCACGCCG TCAACGCCTT CCGCGCGCGA TCGCTGAACC GCCCGTCGAA GTCCGCGGCG AAGCCGGACG ATCGTGGCGC CCTCGGTGCG CCTGTCGGCG AACCCTAG
|
Protein sequence | MIGNAGAADA DGERLEIVRT AERLREIGPA WEALWHDAGA LVFQSHAWTA AWWNAVPDRP RRGLFIVLAW RHDTLVAVLP LATCRWYGVR VLEWAAKDYS DYCDALLRPG IGPAVVQRMW AHADVQGGFD AAYLGHVLPT AIVNTLTDGT RGRGVVLRPH FRQATSLRVV GPWSNSQAWF DSHSGNARRN YRRGLKTLSD NAKVEFRLMA PDEPLGPALQ RCAELKRAWC ARNGLVAPLF DAGSPMLEAL VQVLADNKLL HVFVLERDGV IVAMTVNLMQ HATMMAYVTT YDSSFERSSP GNILLFDYIR WSIDHGATTV DFLCGDEDYK YRFSNQQVTL NSFAGGRTLL GKAAILADKA LHAVNAFRAR SLNRPSKSAA KPDDRGALGA PVGEP
|
| |