Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4920 |
Symbol | |
ID | 6412611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5298520 |
End bp | 5300148 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714802 |
Product | NHL repeat containing protein |
Protein accession | YP_001993884 |
Protein GI | 192293279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.610425 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGCA AACTTGCGCT GGGCGCGAGT GCGTTGACGA TCGGCTTGCT GACGCTGTCA GTCGCACACG CCGACGGCTA TCAGGTCACC AAGCTGGTCC CGGGCTCGGC GTTTCACGGC GTGCACGGCC TTGCGGTCGA CAAGGCCGGC AAGCTGTTCG CCGGCAGCGT CGCCGGCGCA GCGCTGTATG AAGTCGATCG CGCCGCAGGT ACCGCCAAGA TCGCCGTGCC GACGCCGGAA GGCATGGCCG ACGACATCGC GATCGCGCCG GACGGCACGA TGGCGTGGAC CGCCTTCCTC ACCGGCGATC TCTATGCGCG CAAGGGCGAC GGCCCGATCA AGAAGCTGGC GTCCGGCCTG CCGGGCATCA ACTCGCTCGC CTTCCGCAAG GATGGACGGC TGTACGCCAC CCAGGTGTTT CTCGGCGATG CGCTGTACGA GATCGACGTC GAGGGCGCCA AGCCGCCACG CAAGATCATG GAGAAGATGG GTGGGCTGAA CGGCTTCGAA TTCGGCCCCG ACGACAAGCT CTACGGCCCG CTGTGGTTCA AGGGCCAGAT CGTCAAGGTC GATGTCGACA AGGGCGAACT CAGCATCGTC GCCGACGGCT TCAAGGTGCC GGCAGCGGCG AATTTCGACT CCAAGGGCAA TCTGTGGGCG CTCGATACGG CGCTCGGTCA GCTGGTCAAG ATCGATCCGA AGACCGGCGC CAAGCAGGTC GCGGCGCAGC TCAAGCCGGC GCTCGACAAT CTCGCGATCG ATGCGAGCGA CCGCATCTTC GTCTCCAACA TGGCCGACAA CGGCATCCAG GAAGTCGATC CGGCGACCGG CGCGGCCAAG CAGGTGATCA TCGGCAAGCT GGCGTTTCCC GGCGGCATCG GCGTCGTTTC CGACGGCGGC AAGGACACCA TCTACATCGC CGACGTCTTC GCCTATCGCA CCGTCGATGG CGCCAGCGGC GAGGTGCGCG AAGTGGCGCG GATGCACGCC GACGGCACCA CGCTCGAATA TCCGATGAGC GCCACCGCCA AGGGCGACGA GGTAATCCTG TCGAGCTGGT TCACCGGCAC GGTGCAGACG ATCGACCGCA AGACCGGCCA GAGCCGCGAC ATGCTGCACG GCTTCAAGGC GCCTTACGAC GCGATCCGGC TCGGCGGCGG CAAGCTGCTG GTCGCCGAAC TCGGCACCAA GTCGCTGGTC GAAGTCTCGG GCGAGCACGG CAAGGACCGC AAGGCGATCG CCACCGATCT TGCAGGTCCG GTCGGACTGG TCCTCGGCAA AGACAGCGCG GTGTATGTCA GCGAAGCGTT CGCCGGCCAG ATCAGCAAGA TCGATCCGGT GACCGGCGCC AAGACGGTCG TCGCCAAAGA CCTGAAGATG CCCGAGGGCA TCGCGCTCGC GCCGTCCGGC AAACTGATCG TCGCCGAAGT CGGTGCCAAA CGCGTGGTCG AGGTCGATCC GGCCAGCGGC AGCGTGACGG AAATCGCCGG CAATCTGCCG ATCGGCCTGG TCGGCGCCCC CGGCCTGCCG CCGACCAACA TGCCGACCGG CGTCGGTGTC GGCGCTGGCG GCACGATCTA CGTGTCGTCC GATATCGAGA ATGCGATCTA CAAGATCGAG AAGAAGTAG
|
Protein sequence | MKGKLALGAS ALTIGLLTLS VAHADGYQVT KLVPGSAFHG VHGLAVDKAG KLFAGSVAGA ALYEVDRAAG TAKIAVPTPE GMADDIAIAP DGTMAWTAFL TGDLYARKGD GPIKKLASGL PGINSLAFRK DGRLYATQVF LGDALYEIDV EGAKPPRKIM EKMGGLNGFE FGPDDKLYGP LWFKGQIVKV DVDKGELSIV ADGFKVPAAA NFDSKGNLWA LDTALGQLVK IDPKTGAKQV AAQLKPALDN LAIDASDRIF VSNMADNGIQ EVDPATGAAK QVIIGKLAFP GGIGVVSDGG KDTIYIADVF AYRTVDGASG EVREVARMHA DGTTLEYPMS ATAKGDEVIL SSWFTGTVQT IDRKTGQSRD MLHGFKAPYD AIRLGGGKLL VAELGTKSLV EVSGEHGKDR KAIATDLAGP VGLVLGKDSA VYVSEAFAGQ ISKIDPVTGA KTVVAKDLKM PEGIALAPSG KLIVAEVGAK RVVEVDPASG SVTEIAGNLP IGLVGAPGLP PTNMPTGVGV GAGGTIYVSS DIENAIYKIE KK
|
| |