Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4471 |
Symbol | |
ID | 6412155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4808495 |
End bp | 4809556 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714353 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001993442 |
Protein GI | 192292837 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.700837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGATG GCGTGTCGCC GGAGCAGCCC AAAGTCTCGG TGATCATGCC GGTACGCGAC GGCGAACGCT GGGTCGCGGA GGCGATCCGC AGCGTTCAGA GCCAGACGTT CCGCGATTTC GAGCTGCTGG TGATCGACGA TGGCTCCGCC GACGCCACGC CTGCCATTCT GGCCGAGGCG GCCGCGGCCG ACCCACGCAT TTTCGTGCTC ACCCAGCACC GCGACGGTCT GGTAGCTGGC CTCAATCGCG GACTGGCCGC GGCCCGCGCC CCCTTGATCG CCAGGCTGGA CGCCGACGAC ATCGCCCTGC CCGACCGGCT GGCCCGGCAG GTGGATTACA TGACGACGCA TCCCGCGGTC GTTCTACTCG GCGGCTGGGC GACCATTGTC GACGAGAACG GCCAGCCGAA GGGACGCGAC ATGCGCCCGA ACCCGAACGA TCTGCGGGCG ACGCTGATGA AGAAAAGCCC GTTCATTCAT CCGACCGTCA TGTTCCGCGC CGAGGCGGCG CGCCGGGTCG GCGGCTACCA TGCCGCCTTC GAGGCCGGCG AGGATTACGA TTTCTGGCTC CGACTCGCGG ATATCGGCGA GGTCGCAATC CTTCCGGAGC TCCTGATCCG CTACCGCGAG CACAGCGCGA GCGTCACCCG CACACGACAG CTGCGTCAGA TCTACTCCGC CCGCCTCGCC AAGCTCGCTG CCGCCGCGCG CCAAGCCGGT CAGCCCGATC CCTCGGCCGC ACTCATCACC GCTCCGGATT GGCACGATCC ATCCCCCGGC CCGTTCGAGT GCGACAGTTC GCGGCTGTTC CGAATGCTGG AGCTCGCCGA TCCAGCCGTC GCGGCTCGCA TCGCCCCCTC GGCGATCGAT CTAAATGCCA TCACCTCGCA GCTTGGCACC CTGACGGCCG GCGAGCGGAA ATTCGCGCAG GCAGCGCTGC TCAACCTGTT GCGCGCCAAC AAGCTGCCAT CTCTGGCCGC CAGGTGGTCC GCTGTCTTTT TGCTGGTCAG ACTTGGACCG CTCAAGGCGA TTAAGGCGGC ACGGCAGCTT CGAAACCGCT AA
|
Protein sequence | MRDGVSPEQP KVSVIMPVRD GERWVAEAIR SVQSQTFRDF ELLVIDDGSA DATPAILAEA AAADPRIFVL TQHRDGLVAG LNRGLAAARA PLIARLDADD IALPDRLARQ VDYMTTHPAV VLLGGWATIV DENGQPKGRD MRPNPNDLRA TLMKKSPFIH PTVMFRAEAA RRVGGYHAAF EAGEDYDFWL RLADIGEVAI LPELLIRYRE HSASVTRTRQ LRQIYSARLA KLAAAARQAG QPDPSAALIT APDWHDPSPG PFECDSSRLF RMLELADPAV AARIAPSAID LNAITSQLGT LTAGERKFAQ AALLNLLRAN KLPSLAARWS AVFLLVRLGP LKAIKAARQL RNR
|
| |