Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3152 |
Symbol | |
ID | 6410822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3397536 |
End bp | 3398831 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713030 |
Product | protein of unknown function DUF21 |
Protein accession | YP_001992131 |
Protein GI | 192291526 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTCGG TGGAATTGGC AATCGTTGTC GCCTTGATCG TCGTCAACGG TCTGCTGTCG ATGTCCGAAC TGGCGATCGT CTCGTCACGC CCGGCGCGGC TGTCGATCCT GGCGCAGCGC GGCGTCCGCG GCGCGCGCCA GGCGATGAAG CTGAGCGAAG ACCCCGGCCG GTTTCTCTCC ACCGTCCAGA TCGGCATCAC CTTGGTCGGC GTGCTGTCCG GCGCGTTCTC CGGCGCGACG CTGGGCCAGC GCCTGAGCGA CTGGCTGACA GCGTCCGGTG TGCCGTTCGC CGACATCATC GGCTTCGGCC TGGTGGTGAC GCTGATCACC TACGCGACAC TGATCGTCGG CGAACTGGTG CCGAAACAGC TGGCGCTGCG CGATCCCGAA GCGGTGGCGG TGAAGGTCGC GCCGGCGATG GCGCTGCTCG CCAAGATCTC GCTGCCAGTC GTGGTCGTGC TCGACATCTC CGGCAAGGCG ATGCTGGCGC TGCTTGGCCA GAGCGGCGAA CCTGAGGACA AAATCTCCGA AGAAGAAATC CATAGCCTGG TGATGGAAGC CGAGACCGCC GGCATACTCG AGCCTGGTGA GCGCCAGATG ATTGCAGGCG TGATGCGGCT CGGCGACCGC CCGGTCGGCG CGGTGATGAC GCCGCGTCCC GAGGTCGACA TGATCGACCT GTCCGATCCG CCCGACCAGA TCCGCGCCAC TTTCGCGAGC AGCCCGCATT CGCGGTTGCC GGCCACGGAT GGAGATCGCG ACGATCCGAT CGGCATTATC CAATCCAAGG ACGTGCTCGA AGTCTATCTG CGCGGGGAGA CGCCGGACTT CCGGGCGCTG GTGCGCGACG CGCCGGTGAT CCCGGCCTCC GCCGACGCGC GCGACGCACT AATCATGCTG CGCAACGCCT CGGTCCATAT GGGGCTGGTG TACGACGAAT TCGGTGGCTT CGAAGGCGTG GTCAGCACCG CCGATATTCT GGAGTCGATC GTCGGCGCGT TCAGCTCCGA AGACGGGCCG CCGGAGCCCG CCGCAGTGCG CCGCGACGAC GGCTCGTACC TTGTCGCGGG GTGGATGCCG GTCGACGAGT TCGGCGACCT GCTGGGCATG CCGGTGCCGG CGCAGCGCGA TTATCACACC GTCGCCGGTC TGGTGCTGTC GCATCTCGGC GCGCTGCCGA GCGTCGGCGA CAAGTTCGAC TTTCAGGACT GGCGGTTCGA GATCATGGAC CTTGATCACC GGCGGATCGA CAAGATCCTG GCGAGCCGCC TGCCGGATGA CGAAGCCTCG CCATGA
|
Protein sequence | MLSVELAIVV ALIVVNGLLS MSELAIVSSR PARLSILAQR GVRGARQAMK LSEDPGRFLS TVQIGITLVG VLSGAFSGAT LGQRLSDWLT ASGVPFADII GFGLVVTLIT YATLIVGELV PKQLALRDPE AVAVKVAPAM ALLAKISLPV VVVLDISGKA MLALLGQSGE PEDKISEEEI HSLVMEAETA GILEPGERQM IAGVMRLGDR PVGAVMTPRP EVDMIDLSDP PDQIRATFAS SPHSRLPATD GDRDDPIGII QSKDVLEVYL RGETPDFRAL VRDAPVIPAS ADARDALIML RNASVHMGLV YDEFGGFEGV VSTADILESI VGAFSSEDGP PEPAAVRRDD GSYLVAGWMP VDEFGDLLGM PVPAQRDYHT VAGLVLSHLG ALPSVGDKFD FQDWRFEIMD LDHRRIDKIL ASRLPDDEAS P
|
| |