Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0831 |
Symbol | |
ID | 3909089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 944818 |
End bp | 947358 |
Gene Length | 2541 bp |
Protein Length | 846 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882724 |
Product | hypothetical protein |
Protein accession | YP_484453 |
Protein GI | 86747957 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.844423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGGAC GCAGTCCCGA CCCGTCACAG GCCCCGCGCG ATCCAGACGC GACCGCGCGG CTGCGGCTGG CCACGGCTCT GCAGCGGGCG ACGGTCGCGA TCGCCTGGGA GCGGAGTTGG CCGCTGCTGG TCCGGCTGCT GAGCGTCGTG GGCCTGTTTC TGGCGGCGTC CTGGGCCGGG CTGTGGCTGG CGCTGCCGTT CACCGGGCGC ATCGTCGGCC TGGTGCTGTT CGCTGCGCTG GCGCTGGTGG CGCTCTACCC CGTCCTCAAG TTCCGCTGGC CGAGCCGCGA CGAGGCGCTC GTCCGGCTCG ACCGCAACAC CGGGCTGAAG CATCGGCCCG CAACAGCGCT GACCGATACG CTGGCGTCGA GCGATCCGGT GGCGCAGGCG CTGTGGCGGG CGCAACGCGA GCGCACGCTG GCGGCGCTCC AGGGCATTCG CGCCGGGCTG CCGGCGCCGC GGCTGCCGAA GCACGATCCA TGGGCGCTGC GCGCCCTGGT CGCGGTGCTG CTGGTCGCCA CCTTCATCGC CGCCGGCGAG GAACGGACCG CGCGCGTCGC CGCGGCGTTC GACTGGAACG GCGCGCTGGC CGCGCCCAAC GTCCGGGTCG ATGCCTGGGT GACGCCGCCG GTCTACACCA ACAAGCCGCC GATCATCCTG TCGGCCGCCA ACAAGGATCT CGCCGCGCAG AACCAGGCGG CGCTGCCGGT GCCGGCCGGC TCGACGCTGC TGGTGCGATC CAGCGGCGGC GCGCTCGACG TCGCGGTGAC GGGCGGGATT GTCGAAGCCC AGCCGGAGGG CGAGGCGCCG GCGGGCACCA GCGAGCGGCA TTTCAAGATC ACCGGCGACG GCACCGCGCA TGTCCGCGCC CCGTCCGGCC AGCCGCAATG GGCGTTCAAG GTGACGACCG ATCACGCGCC GTCGATCGCT CTGGCCAAGG AGCCGGAGCG GCAGGCGCGC GGCTCGCTGC AATTGTCCTA CAAGCTCGAA GACGATTACG GCGTCACCGA AGCCCATGCG CTGATCGCCC CGTCGCCCTC CGCCGCCGCG CAGACCACCG AGCCGCCACG GCCGCTATAC GAACCGCCGC ATTTCGCGCT GACGCTGCCG AATGCGCGCA CCCGCGCCGG TGTCGGCCAG ACCGTGAAGG ATCTCAGCGA AGATCCCTAT GCGGGCGCCG AAGTGACGCT GACGCTGACC GCCAAGGACG AGGCCGGCAA TGAGGGCCGC AGCGAACCGC ATAAGATGCG GCTGCCGGAG CGGCTGTTCA CCAAGCCCTT GGCGCGGGCG CTGATCGAGC AGCGCCGCAT CCTGGCGCTC GACGCCACCA GGAACGCGCA GGTCTACACC GCGCTCGACG CGCTGATGAT CGCGCCGGAA GCGTTCACGC CGGACGCCGG CCAGTATCTC GGTCTCTACA CCGTCGCCGA CCAGCTCGAG CGCGCTCGCA CCGACGACGC GCTGCGCGAG GTGGTGGCCA GCCTGTGGTC GCTCGCGCTG GCGATCGAGG ATGGCGATAC GTCCGACGTC GAGAAGGCGC TACGCGCCGC GCAGGATGCG CTGAAGCAGG CGCTGGAGCG TGGCGCCTCC GACGAGGAAA TCAAGAAGCT CACCGAAAAT CTGCGCGCCG CGCTCGACAA TTTCATGCGC CAGCTCGCCG AGCAGATGAA GAACAATCCG CAGCAACTCG CCCGCCCGCT CGATCCGAAC ACCCGGGTGA TGCGGCAGCA GGACCTCAAC AACATGATCG AGCGCATGGA GCGGCTGTCG CGCTCCGGCG ACAAGGATGC CGCCAGGCAA TTGCTCGAAC AGCTCGCCCA GATGCTCGAA AACCTGCAGA TGGCGCAGCC CGGCCAGGGC GGCGACGACG ACATGCAGCA GTCGATGAAC GAGCTCGGCG ACATGATCCG CAAGCAGCAG CAACTGCGCG ACAAGACCTT CAAGCAAGGC CAGGATCAGC GCCGTGACCG GATGCGCGGC CAGAACGGTG AGCAGAGCCT CGGCGATCTG CAGCAGGATC AGCAGAACCT GCAGGAGCGG CTGCGCAAGC TGCAGCAGGA ACTCGCCAAG CGCGGGATGG GGCAGCAGGG CCAGCGCGGC CAGAACGGCG AGCAGGGCCA GCAAGGCGAG CAGGGCGAGG GCGGTCTCGA CCAGGCCGAG TCGGCGATGG GCGACGCCGA AGGCCGGCTC GGCGAAGGCA ATGCCGACGG CGCCGTCGAT TCCCAGGGCC GTGCGCTCGA TGCGCTGCGC AAGGGCGCGC AGAAGCTGGC CGAAGCGATG CAGCAGGGCG ACGGGCAGGG CCAGGGCGAT GGCCCGGGCA GCCGTCCCGG CCGGCAGCAG AGCAGCGGCA ACAACACCGA TCCGCTCGGT CGGCCGTTGC GCGGCCGCGA ATTCGGCGAC GATCTCACGG TGAAGATTCC CGGCGAAATC GACGTCCAGC GCGTCCGCCG CATCCTCGAA GAACTCCGCC GCCGCCTCGG CGATTCGGCC CGGCCGCAGC TCGAGCTCGA CTACATCGAG CGGCTGCTGA AGGATTATTA G
|
Protein sequence | MSGRSPDPSQ APRDPDATAR LRLATALQRA TVAIAWERSW PLLVRLLSVV GLFLAASWAG LWLALPFTGR IVGLVLFAAL ALVALYPVLK FRWPSRDEAL VRLDRNTGLK HRPATALTDT LASSDPVAQA LWRAQRERTL AALQGIRAGL PAPRLPKHDP WALRALVAVL LVATFIAAGE ERTARVAAAF DWNGALAAPN VRVDAWVTPP VYTNKPPIIL SAANKDLAAQ NQAALPVPAG STLLVRSSGG ALDVAVTGGI VEAQPEGEAP AGTSERHFKI TGDGTAHVRA PSGQPQWAFK VTTDHAPSIA LAKEPERQAR GSLQLSYKLE DDYGVTEAHA LIAPSPSAAA QTTEPPRPLY EPPHFALTLP NARTRAGVGQ TVKDLSEDPY AGAEVTLTLT AKDEAGNEGR SEPHKMRLPE RLFTKPLARA LIEQRRILAL DATRNAQVYT ALDALMIAPE AFTPDAGQYL GLYTVADQLE RARTDDALRE VVASLWSLAL AIEDGDTSDV EKALRAAQDA LKQALERGAS DEEIKKLTEN LRAALDNFMR QLAEQMKNNP QQLARPLDPN TRVMRQQDLN NMIERMERLS RSGDKDAARQ LLEQLAQMLE NLQMAQPGQG GDDDMQQSMN ELGDMIRKQQ QLRDKTFKQG QDQRRDRMRG QNGEQSLGDL QQDQQNLQER LRKLQQELAK RGMGQQGQRG QNGEQGQQGE QGEGGLDQAE SAMGDAEGRL GEGNADGAVD SQGRALDALR KGAQKLAEAM QQGDGQGQGD GPGSRPGRQQ SSGNNTDPLG RPLRGREFGD DLTVKIPGEI DVQRVRRILE ELRRRLGDSA RPQLELDYIE RLLKDY
|
| |