Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3851 |
Symbol | |
ID | 3911655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4403661 |
End bp | 4404719 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885752 |
Product | hypothetical protein |
Protein accession | YP_487455 |
Protein GI | 86750959 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA TTCGAGCAAG CGACGCAGCC GAGATGCGGA CATGTCGAGG GGATAACCTG GGACGACTGC TGCACAGGGC GTGTTTCGGC CTGCTCGTCG CTGCGATCTT GGCGACGCCC TCGGGGGCCG GGCCGGCCTA TCCGGTGCGG GCCATCACGG TGATCGTACC GTTCGCCGCC GGGGGGCCGA CCGACATCGT CACCCATATC GTCACCGACC ATCTGGCAAA GCGGCTCGGG CAGCAGATCA TCGTCGAAAA CGTCGTCGGC GCCGGTGGTA CTACTGCGGC AATCCGCGCG ACGCGCGCCG CTCCGGATGG CTACACGCTG CTGATGGGGC ACATGGGAAC TCACGGCAGC GCGCTCGCGG CATACCCGCA ACTGGTCTAC GATCCCGTCA CGGACTTCAC GCCGATCGGG CTGGTGGTGA GAACGCCGGT TCTCGTGGTC GTCCGGCGCG GCCTGCCAGT GCACGATCTC GCTGATCTGG TGCGCTATTC GAGCAAGCAC GGCAAAGCTA TCACGATGGC ACACGCCGGT TTCGGCTCGA TCTCCCATGC GACCTGCACG CTGTTCAACA CGATCGGCGA CATTCATCCC GCAACGCAGG CGTTTCAGGG CACCGGCCCG GCACTGAAGG CGCTCATCGC CGGACGCGTC GACTATATGT GTGATCAGAT CGTCGGCGCG GCGCCGCGCG TCGGAACCGG AGAGGTGAAG GCGCTGGCGA TCTCCTCGGG CCGGCGAAGT CCGATCCTGC CGGACGTCCC GACCTCCGCC GAAGCCGGCT TCCCCGATTT CGAAATCTCA GCCTGGAGCG CGCTGTTCGC GCCGAAAGCA ACGCCGCAGC ACATCGTCGA TGGACTCAAC GCCGCGTTGG CTCACGCGCT CGACGATCCC GAGGTCCGCC AGGCGCTGGG CAGTCTCGGC GGCGAAATCC CGCTCGCAGA CGAGCGCAGC GCGTCCACGC TCGCCACGCT GATCGAACGC GAAGCCACGC GATGGCGGTC GCTGTCCCGA CCGGCCGCCG TATTGGACGG TTCGCGGGTA CGGCAATGA
|
Protein sequence | MNQIRASDAA EMRTCRGDNL GRLLHRACFG LLVAAILATP SGAGPAYPVR AITVIVPFAA GGPTDIVTHI VTDHLAKRLG QQIIVENVVG AGGTTAAIRA TRAAPDGYTL LMGHMGTHGS ALAAYPQLVY DPVTDFTPIG LVVRTPVLVV VRRGLPVHDL ADLVRYSSKH GKAITMAHAG FGSISHATCT LFNTIGDIHP ATQAFQGTGP ALKALIAGRV DYMCDQIVGA APRVGTGEVK ALAISSGRRS PILPDVPTSA EAGFPDFEIS AWSALFAPKA TPQHIVDGLN AALAHALDDP EVRQALGSLG GEIPLADERS ASTLATLIER EATRWRSLSR PAAVLDGSRV RQ
|
| |