Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3869 |
Symbol | |
ID | 6411549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4157215 |
End bp | 4158255 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642713751 |
Product | hypothetical protein |
Protein accession | YP_001992842 |
Protein GI | 192292237 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.649147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTCG AGATCTCAAA TCCGGCGGTG ATGATCGTCG ATCCGGCGCG GACCAAGGTC GCCGGCGCGA TCAAGCAGGC CGCCGGCGCC ACCGGCACCA GCTTCCAGTA TCTGCTCGCC ACCGCCAAGA TGGAGTCGGA CTTCAACCCG ACCGCCCAGG CCACCACCTC GTCGGCGCAG GGTCTGTTCC AGTTCATCGA GCAGACCTGG CTCGGCACCG TGAAGGAAGC CGGCGCGCAG CTCGGCTACG GCCAATATGC CGATTCCATC ACCCGCTCGG CGTCCGGCAG CTATTCGGTC AGCGATCCGT CGGCGCGCGC CGCGATCATG AATCTGCGCA ACGATCCGGT GGCGTCCTCG GCGATGGCGG GGGTGCTGAC CCAGTCGAAT AGCTTCAAGC TGACCGGCGA GATCGGCCGC CGTCCGTCGG ATGCCGAACT CTACATGGCG CACTTCATGG GCGTGTCCGG CGCCGCCAAG CTGATCAACG CCGCGAGCGA CACCCCCAAC GTTGCCGGCG CCGCGCTGTT TCCGTCGGCA GCGGCCGCCA ATCAGTCGAT CTTCTACGAT CGCTCCGGCA ACGCCCGCAG CGTCTCCGAG GTGTATTCCA ATCTCGCCAC GCGCTACGAG GCGGCCGCCA ATGCACCCGC AACGCAGAGC GCAATCGCCT CGGTCGCCGG CCTGCCGGTG ACGCTGGCTT CGGCCGCACC GCAAGTGCCG GTCGACAACG CCGCGTATCT GGCGAGCTTC CCGGACGTGC GCAACGTCAC GCCCGCGCAA GCCGGTGACG CGACCCGCAC CGCGACGTCG CAGCGCGCCA ACGAACCGAT GTTCCGCTCG CTGTTCCTCG GCGGCGACCG CGCGGAGCCG GTATCGCCCG CCGTGCAATC GCTGTGGACC TCGCCGTCGC AGACGACCAT GCCGACGCAG ACGACGATGC CGGCTCAAAC CGCGCTGCCC GATTTGCCCC GGACTCCCGA AGTCCGCACT CCGACCCCGC TCGACCTGTT CAGCGACCGC AACGGCACGT TCGCGAGCTG A
|
Protein sequence | MAVEISNPAV MIVDPARTKV AGAIKQAAGA TGTSFQYLLA TAKMESDFNP TAQATTSSAQ GLFQFIEQTW LGTVKEAGAQ LGYGQYADSI TRSASGSYSV SDPSARAAIM NLRNDPVASS AMAGVLTQSN SFKLTGEIGR RPSDAELYMA HFMGVSGAAK LINAASDTPN VAGAALFPSA AAANQSIFYD RSGNARSVSE VYSNLATRYE AAANAPATQS AIASVAGLPV TLASAAPQVP VDNAAYLASF PDVRNVTPAQ AGDATRTATS QRANEPMFRS LFLGGDRAEP VSPAVQSLWT SPSQTTMPTQ TTMPAQTALP DLPRTPEVRT PTPLDLFSDR NGTFAS
|
| |