Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2661 |
Symbol | |
ID | 3910454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3043165 |
End bp | 3045486 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884561 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_486274 |
Protein GI | 86749778 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.305468 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG CGTTCTGGCG TCGGACGAAG GGCGAGACGA CGAGCAATGA ACCGGCTTCG CCCGCGCATC CGGCCGCCGC TACGGCCTCG AAGGCAGAGC GTATCAAGGC CGCGGCGAAG CAGCCGGTCT TCAACAAGTC TTCGCCCGAC GAGCCGGTCA TCGTGCGGAG GACGCCGGCG CCGATCGAGC CCCCGACCAA CGGCGATATC GACCTGCGGC TGATCGGGCA GGCGCTGGCG CGCAAGAAGC ACCTGATCAT CGCACCGACA TTGCTGGCGC TGGTGCTGTC GCTCGCCATC GTCAATCTGA TCACGCCGCG TTACAAATCT GAAGCCCGCA TCCTGATCGA CGGCCGCGAA AACATCTTCC TGCGCCCGAC CGGCGAACGC GACGAGCAAC GCAATGCGGT CGATCCGGAA GCAGTCACCA GCCAGGTGCA GTTGCTGCTG TCGCGCGAGC TCGCGCTCGA AGTCATCAAG CAGAACAAGC TGGCCGAGCG TCCCGAATTC GATCCCGTCC TGAAGGGCAT CAACCCGCTG AAGTCTCTTT TGGCGATGGT CGGTATCGGC CGCGATCCGT TCTCCATGAC GCCGGAAGAG CGCGTTCTCG ACGCGTATTA CGAGCGGCTC ACCGCCTATG CGGTCGACAA GTCGCGCGTG ATGGTCGTCG AATTCCAGTC GCAGGATCCC GAGCTCGCGG CGCGCGTCGC CAATTCCATC GCCGACGACT ATCTGGTGCT GCAGCAGAAC GCGCGCCAGG CGCAGGCGCG GTCCGCGGGG CAATGGCTGT CCGGCGAGAT CGAATCGTTG CGCAAGAAGG TCGCCGAGGC CGAATCCAAG GCCGAGGATT TCCGATCGAA ATCGAGTCTG TTCATCGGGA CCAACAACAC CACGCTGTCG AACCAGCAGC TCGGCGAGCT CAACACCCAG CTCGGCAATG CGCGCGCCCT GAAATCCGAC GCCGAGTCCA AATCACGGCT GATCAAGGAG ATGCTGCAGG GCGGTCGTCC GATCGAAGTG TCGGACGTGC TGAACTCCGA TGTGATGCGG CGATTGTCGG AGCAGCGTGT GATGCTGCGC ACGCAGCTCG CCGAACAATC GTCGACGCTG CTCGACAATC ACCCGCGGAT CAAGGAGCTG AGGGCGCAGC TCGCCGATCT CGACCGGCAA TTGCGCGACG AGGCGATGAA GCTGTCGCGT TCGTTCGAGA GCGACGCGCG GATCGCGAGC GGACGGGTCG ACAGCCTGAT CGCCAGTCTC GAGCAGCTGA AGAAACAGGC GTCTTCGACC AATGGTCAGG ACGTCGAACT GCGCGCGCTC GAGCGCGAAG CCAAGGCGCA GCGCGATCTG CTGGAATCCT ACCTGGCGAA ATACCGCGAA GCGACCACCC GAGAGACCAT CGATCAGGCG CCGGCCGACG GCCGTATCAT TTCGCGGGCC ATCGTGTCGA ACACGCCGGC CTATCCGAAG AAGTTGCCGA TCGTGCTGAT CGCCACGCTG GCGACGCTGA TCCTGACCGC CGGCGGCATC GCCACCGGCG AATTGCTGCG AATGACCCAG CCGCGCGCCG CCGGCCTCGC GATTCCGGCG GCCGAGCCGG CGCGGACGCC CGCCGCGATG CAGGCACCGA TGTTCGTCAC GCCTGCCGCA GCAGCTTCGC CGCCGATGGC GCCGGCCCGC GCCGGCACCG GCGAACCCGC GGCCGAACGA GCCGCCGATG CGGACGACAT CGAGGCGTTG GCGCATCGGC TGCGCAGCGG GGGCGAGGCT GCACGCAAGC TGACCGTGCT CGGCACCGGC GACACTGCCG ATGTCACGGC GACGGCTTTG AGCCTGGCGC GTCTGTTGTC GCGCGACGCC AGGGTGGTGC TGGTCGATCT GTCGGAATCC TCCGCGATGT TGAAAGCGGC CTCGGCCGAT CCGGCCGCGC CGGGACTCGC GGAACTGATG CAGGGCGAAG CGTCGTTCGG CCAGGTCATC ACCCGCGACC GCAGCACGGC GCTGCATCTC GTCAGTGCGG GACGGCCCGG CTTCGATCGC AACCTGCTAC AATCGCCGCG GCTGGTCGTG GCGCTGAACG CGCTGCTGCG GGTGTACGAT CACGTCCTGC TCGACGCCGG GACCGCCGCC GATCTGCCCG CCGAAATGTT GACGGCGCAG GCGCGGGCCG TGGTGGTGCC GGCTTCCGAC ATGCCGGCGG ACGCCCGTCT CAAGATGGCG GACCAGCTCA GGGCGGTCGG CTTCTCGGAG GCGACGATGG TGCGCGCCGC GGCGCGGCCG TCGGGTCGCA TCGAGCCCGG CGCGCGCACC GTCGCCGCGT AA
|
Protein sequence | MRFAFWRRTK GETTSNEPAS PAHPAAATAS KAERIKAAAK QPVFNKSSPD EPVIVRRTPA PIEPPTNGDI DLRLIGQALA RKKHLIIAPT LLALVLSLAI VNLITPRYKS EARILIDGRE NIFLRPTGER DEQRNAVDPE AVTSQVQLLL SRELALEVIK QNKLAERPEF DPVLKGINPL KSLLAMVGIG RDPFSMTPEE RVLDAYYERL TAYAVDKSRV MVVEFQSQDP ELAARVANSI ADDYLVLQQN ARQAQARSAG QWLSGEIESL RKKVAEAESK AEDFRSKSSL FIGTNNTTLS NQQLGELNTQ LGNARALKSD AESKSRLIKE MLQGGRPIEV SDVLNSDVMR RLSEQRVMLR TQLAEQSSTL LDNHPRIKEL RAQLADLDRQ LRDEAMKLSR SFESDARIAS GRVDSLIASL EQLKKQASST NGQDVELRAL EREAKAQRDL LESYLAKYRE ATTRETIDQA PADGRIISRA IVSNTPAYPK KLPIVLIATL ATLILTAGGI ATGELLRMTQ PRAAGLAIPA AEPARTPAAM QAPMFVTPAA AASPPMAPAR AGTGEPAAER AADADDIEAL AHRLRSGGEA ARKLTVLGTG DTADVTATAL SLARLLSRDA RVVLVDLSES SAMLKAASAD PAAPGLAELM QGEASFGQVI TRDRSTALHL VSAGRPGFDR NLLQSPRLVV ALNALLRVYD HVLLDAGTAA DLPAEMLTAQ ARAVVVPASD MPADARLKMA DQLRAVGFSE ATMVRAAARP SGRIEPGART VAA
|
| |