Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3755 |
Symbol | |
ID | 3911558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4290929 |
End bp | 4292086 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637885656 |
Product | hypothetical protein |
Protein accession | YP_487360 |
Protein GI | 86750864 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.824369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGACA TCGCGACTTC AACGCCGATC ACACGCGCCC CGCGAGGCGG GGCGGCGACA CGCGCGGTTT TGCCGGCGAT GGCTGCGATC GACGCCGGCG GCTGGCGCGA TCTCGCCGAC CGGGCGCTCG AACCGAATGC CTATTACCTG CCGGAATGGG CCATGGCCGC GAACGGCGAC GCCCGTGCCC GCGCCGTGAC CCACGCGCTG ACGGCCGATG ATGCGAGCGG GGCGCTGATC GGCCTGTTGC CGGTGCTGTC GGCCTGGCAA GCGTTTCGGT TGCCGCTGCC GGTGCTGGTC TCCGCCGATC CGTTCCGCTC GCTCGACACG CCCGTGCTCG ACCGCGGCGC TGCCGACCGC GCCGCCGCCG CAATGCTGGC TCAGGCCCGC GCGACCGGAG CCCACGCCTT GCTGCTGCGC GACGTCGCGC GCGACGGCGA AGCCATGCAG ACATTGTCGC GCGCCGCGGC TGCCGACGGC CTTTCGCCGG TGCTGCTGCG CGGTTGGTCG CGCGCCTGCC TCGACGCCAC GCGCGACGGC GACGAATTGC TGCGCGACGC GCTCGGCGGC AAGCGTCTCA AGGAATATCG CCGGCTGACG CGCCGGCTCG GCGACCACGG CGAGGCGCGC TTCAGCATGG CGCGAACGCC GGATGCGGTG GCCGAGGCCT ATGATCTGTT TCTCGCGCTG GAAGCCAGCG GCTGGAAGGG CCGCCGCGGC ACCGCGCTGA TGCACCAGCC GGAGCTGGCC GGGCGGCTGC GGCAAGCCGC GATCGCGCTG GCCGCGCGCG GCGCCTGCGA GATCGCGCTT TTGCACGCGG GCGCAGAGCC GATCGCCGCC GGCATCGTGC TGCGGCAAGG CGACCGCGCC TTCTTCTTCA AGCTCGGCAT CGCCGAAGGC TTCGCCCGGC ATTCGCCGGG CGTGCTGCTG ACACTGGAAC TGACGCGGCA TCTTTGCGCC GATCCGGCGA TCGCGATGGT CGATTCCACC GCTGCGCCCG ATCATCCGAT GATCGACCCG ATCTGGCGCG GGCGGCTGGC GATGGGCGAC GTGCTGATCC CGCTGCGATC GCGCGATCCG CTGTTCGGAC CGATCACGCT CGCCCTGCGC GCCCGCGAAG CGCTGCGCCA GACGGCGAAG CGCATGCTCA AGCGATAG
|
Protein sequence | MADIATSTPI TRAPRGGAAT RAVLPAMAAI DAGGWRDLAD RALEPNAYYL PEWAMAANGD ARARAVTHAL TADDASGALI GLLPVLSAWQ AFRLPLPVLV SADPFRSLDT PVLDRGAADR AAAAMLAQAR ATGAHALLLR DVARDGEAMQ TLSRAAAADG LSPVLLRGWS RACLDATRDG DELLRDALGG KRLKEYRRLT RRLGDHGEAR FSMARTPDAV AEAYDLFLAL EASGWKGRRG TALMHQPELA GRLRQAAIAL AARGACEIAL LHAGAEPIAA GIVLRQGDRA FFFKLGIAEG FARHSPGVLL TLELTRHLCA DPAIAMVDST AAPDHPMIDP IWRGRLAMGD VLIPLRSRDP LFGPITLALR AREALRQTAK RMLKR
|
| |