Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3532 |
Symbol | |
ID | 3911334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4042224 |
End bp | 4044602 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637885434 |
Product | hypothetical protein |
Protein accession | YP_487138 |
Protein GI | 86750642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.263961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATAC CCACCAACTT TGACCCATCG AGTTCAGTTC TATTCCTCGG TTCGGGATTT AGCGCAAGTG GGACGGCAAT CTATGGCGGA CATCCACCGG CAGGAGACCA ATTGTGCGAC CTGTTGGCGG ACGAACTGAA AGTGGCTCGC GGGAAATACG ATTTGCAGGC GCTCGCAGAC GCCTTTCGCA GGCGACCTGA GCTGAATATG TATCAATCAC TGCGTCGGCT ATTTACGATA AGCAATCTGA GCGTCGATCA GCGCGAAATT TTGAAGCTTC GCTGGCAACG AATTTATACG ACGAACTACG ATGATAGTGT AGAACTGGCG TTTCATGAGA ACGGCATCAA GTCGCCTTCT TACAACTACG ACGACCCGAA GCCCGCACGC GTTCCACGCG GTGCTGTGAT TCACCTTCAC GGTGTGATCA AGAAAGCCAC CGAAGAGAAT ATTCACCAAC AGCTCGTGCT CGGGCGGCAG TCCTATATAA GGCAATTCTT TGCGAAATCT CCTTGGTATG ATGAATTCTT GCGAGACATA AGATTTTGCG AAGCCATTTT TTTTGTTGGC TACAGCCTTG CTGATCCACA CGTCACAGCT CTTTTCGTTA ATCCCGAGCA GTCCAAACTC CGGACTTATT TCGTGCTTCG ACCGCCTCTG GACTCCCTAT TGGTGGAGCA CATCGAGGAG TATGGAGAAG CCCATCCGAT AGAAACGAAA GGGTTTGCTC AGATATGTCG GTCGCTTAGC GCGCCACCGC CGCTTGCAGA CCTAAATAAT CTCCGAATTC TTCGTTGGAT CGATCCATTC AAAGATCAAA AGACAGTAAT TCAACCGACC TCGCTGGAGG TAATCAACTT AGTGGCCTTC GGTGCTTTTG ACTCACAGCG TGCTTTATCA ACTCTGCCCA ATGGGAAGTA TGTAATACCT CGCCAGAAGA TGGCGCGGCG GGCTGTTCAG CAAGTGGTAC AAAATCGCAC GACGCTTCTT CATGGCCGCC TGGGAAATGG GAAATCTATA TTCCTTTGGA TACTCGCCTT TCATCTGATG GCTCTAGATT ATCAATGTTT CCGATGCAGT GCCATGTCGC CAGCCATTGA CCGCGAGGCT AAAGCGCTAG TGGATCATCC AAAAGTGGCG ATTCTATTTG ACAGTTATGA TGTTGCGATT GATTCCGTCG ATCGCCTGTA TGAGCTTTTG CCGCACGCGC GATTTATAGT TTGCGTGCGG AGCGGCGTTC AGGACGTTAG GCTGCACGAA ATTGCAACGC GGTTTCCGTC GTCGATTGCG CGCGTGAATC TCAATGAGTT TGATGCCGAA GATCGTTCGG ACTTCATTGA TCTCTTGGTG CCGGCCGGAG CTCTTAAGGA CGATCTTGAA AATAGGATAC GCAGTTGTGC CGACATCCGT GAAGTCGTTG CCACGATCTA CGAGAACGAA TTCATTCAGC GTCGGATTCG AGAATCGCTA GCTCCGCTGC GCTCTGATCG ATCTGCTGCA GCGGTCACCA TTCTGGGATT GCTGTTATCT TGGATCAATC AGACTGGTGA TCCATCGTTA TATCTTGAAG CGCTCGATGC GGACCCGCAC ACCACGCTTG CAAAATACCG AGAGGTCGCC ATCGATATTT TTCGGCTTGA CGATGATCAG ATTCAAGCAA GATCGCCGGT GTTTTCGGAT TACATTCTCC GTCGTCTTTT CTCGGTTGAT GAGATATTCC CCGTTGTCGA GAAGGTATTG ATCGCGGCCG TTCAGCGCAA AAAGGAAAGA AAATACCGTG CGATACTAAG TAATATAATG CGTTACTCAG CGCTCTTGAG CTTGTCCAAA GAGGCTCCCG ATGGCGCGAA CAAGATCATT GGTTTGTATG GCCGACTGCA GCGAGATGTC GGCATTCAAG AGGAGCCGCT GTTTTGGTTG CAGTACGCAA TCGCTATGAC TGAAGCGGAT TCAGCCGAAA TTGCGGAAGG TTTTCTTAGG ACGGCGTACC GCAAGGCCGC CGAAGCTGGA GATTTTGCGA CCTATCAGTT GGACACTTTT GCCCTTCGCC TGTATTTGAA GCTGGAGGAA AAGGCTGAAG TGGGTAGGTC TGTAAGCCGT ATTAAGATGA TTCTGTACTC GACTAAATTA GTTTCGGGAA TGATTGGTGA TCAAAACCAT CGGGCTTATG CAGTAAGAGT CCTAGAGGGC TGGTTGCCGT TTGTCGCTTC AAGGGTTGCG GATCTCACCG GTTCGCAGAA AACGAAGTGC TTGGCTGCGG TGGACGATCT GTTGCATAAG ATTTCGGGGC TGAGCGCTGC AGTTAGAGCG GAGACAGGAT CTGACCAAGT TAAGTCCGAC CTTGAAGCTG CTAAGCGAAC GTTGCTGCTC GGAGCATAG
|
Protein sequence | MPIPTNFDPS SSVLFLGSGF SASGTAIYGG HPPAGDQLCD LLADELKVAR GKYDLQALAD AFRRRPELNM YQSLRRLFTI SNLSVDQREI LKLRWQRIYT TNYDDSVELA FHENGIKSPS YNYDDPKPAR VPRGAVIHLH GVIKKATEEN IHQQLVLGRQ SYIRQFFAKS PWYDEFLRDI RFCEAIFFVG YSLADPHVTA LFVNPEQSKL RTYFVLRPPL DSLLVEHIEE YGEAHPIETK GFAQICRSLS APPPLADLNN LRILRWIDPF KDQKTVIQPT SLEVINLVAF GAFDSQRALS TLPNGKYVIP RQKMARRAVQ QVVQNRTTLL HGRLGNGKSI FLWILAFHLM ALDYQCFRCS AMSPAIDREA KALVDHPKVA ILFDSYDVAI DSVDRLYELL PHARFIVCVR SGVQDVRLHE IATRFPSSIA RVNLNEFDAE DRSDFIDLLV PAGALKDDLE NRIRSCADIR EVVATIYENE FIQRRIRESL APLRSDRSAA AVTILGLLLS WINQTGDPSL YLEALDADPH TTLAKYREVA IDIFRLDDDQ IQARSPVFSD YILRRLFSVD EIFPVVEKVL IAAVQRKKER KYRAILSNIM RYSALLSLSK EAPDGANKII GLYGRLQRDV GIQEEPLFWL QYAIAMTEAD SAEIAEGFLR TAYRKAAEAG DFATYQLDTF ALRLYLKLEE KAEVGRSVSR IKMILYSTKL VSGMIGDQNH RAYAVRVLEG WLPFVASRVA DLTGSQKTKC LAAVDDLLHK ISGLSAAVRA ETGSDQVKSD LEAAKRTLLL GA
|
| |