Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3820 |
Symbol | |
ID | 3911623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4365335 |
End bp | 4366828 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885721 |
Product | hypothetical protein |
Protein accession | YP_487425 |
Protein GI | 86750929 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.499041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGAG TTTTGTCCCG GTGGGGGTTT TCCATCGGTC TGTCGGCGTT GGTCGTGGCC GCGCTGGTGT TGCCGGTGGT GCTCGCTGCG TTGGTGTTTC CCACACCGCT CTACGACACC CGCGAACTGG TCGCCTGGGG CCGGCATTTT CCGCTGCTGA CGCCTGTGCA TCCGCCGATG ATGGTGTGGG CCGGCGGCGC GGTGGATCGC CTGTTCGGGC CGTCCGGCAC GGCGATCGTG TTCGTCAACC AGATCTGCCT CGCGATCGGG CTCGGCTATC TCTACGCCAC GCTACGGCTG CTGGTCGACC GGGCGATGGC GGCCTACGTC CTGGCGCTCG CCGCGGCGTC GTTCTACGTC GTGTTCGCGC CGCTGTCGTG GGCGCTCAAT GCCGACATCC TGCAGCTCAT GTCCTGGCCG GCGGTGCTGT ATCACTTCCT GCGCGGGCGC AGCACCGATC GCTGGCTGCA TTGGATCCTG CTCGGCATCT GGGCGGCGAT CGCGGCGCTG ACGAAGTACA ACGCCGCCGT GCTGTTCCTG GCGATGGCGG CGGGCATCGT CGCGGTGCCG TCGGTTCGCG CCTGCCTGCG CCGGCCGGGG CCCTATGTGG CAGTGCTGGT CGGCGCGCTG TTGTTCCTGC CGCACGCGAT CGCGGCCTGG CGCTACGGCA CCACCATCGC CTATGGCGAG CGGCATTTCA CCGGCTTCGG TTCGGTTGCG GACACCGCGC GGCGGCTCGG CCTGCTGGTC GCCGGCTATC TGCCGCTGCT GCTGCCCGGC GCGATCGTGC TGGCGATCGC GGTCGGGCGG CGGATGATCG CGTGGCGGGT GCCGCGGTTT GCCGAGGCCA GTGACGAACT CAAGTTCATC GTCATCGTCA ATGTCGCGAT GTTCGCCGTG CTGGTGGTGC TGATCGCCGG CTGCGGTCTG GAATACATCG CCCGCTACGG CGCGCCATTC GCCGAACTCG CGGTGCTGGC GCTGGCGCCG CTGTTCACCT GGAACGAGTC GCGGCGCGCC GTGGCGGTGC GCCAGACCGT GCAGTCGCTC GGCGTGCTGT ATGCTGTGCT CGCCGCCGGG GCTTCGGTCG TCTATCTGTT GTTCGCCTCG CATAGCGGGC TGCAGGAGCC GACCGCGCAG GCGGCGCGCG TCATCCTTGC CGACTGGAAC AGCAAGTACA AATGCGGTCC CGGTTACTAT CTGGGCGACC GCCAGACCGT CTACGGCATC GGCATCGCCG CCGGCCCCGA CGGCGATTCG ATGACCATCA ACTTCATTCC GAAGGCGCGC TGGTTCGATA CGGCGAAACT CGAAGCGAAC GGCGCGGTGC TGGTCTACAC GCTGCCGCAG GTCCCCGGCG ATTTCGCCAC GGCGTTTCCC GGTGCGACGA TGTCCGAGGA AAAGCGCATC AGCGTGCCGG TGCTGCGCAC CCACACCGGC AAAACCAAGG ACTACTTCTA TCGCTTCGTC GCGCCGAAGG CGTGCACGGG CTGA
|
Protein sequence | MDRVLSRWGF SIGLSALVVA ALVLPVVLAA LVFPTPLYDT RELVAWGRHF PLLTPVHPPM MVWAGGAVDR LFGPSGTAIV FVNQICLAIG LGYLYATLRL LVDRAMAAYV LALAAASFYV VFAPLSWALN ADILQLMSWP AVLYHFLRGR STDRWLHWIL LGIWAAIAAL TKYNAAVLFL AMAAGIVAVP SVRACLRRPG PYVAVLVGAL LFLPHAIAAW RYGTTIAYGE RHFTGFGSVA DTARRLGLLV AGYLPLLLPG AIVLAIAVGR RMIAWRVPRF AEASDELKFI VIVNVAMFAV LVVLIAGCGL EYIARYGAPF AELAVLALAP LFTWNESRRA VAVRQTVQSL GVLYAVLAAG ASVVYLLFAS HSGLQEPTAQ AARVILADWN SKYKCGPGYY LGDRQTVYGI GIAAGPDGDS MTINFIPKAR WFDTAKLEAN GAVLVYTLPQ VPGDFATAFP GATMSEEKRI SVPVLRTHTG KTKDYFYRFV APKACTG
|
| |