Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0855 |
Symbol | |
ID | 3969813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 943840 |
End bp | 944820 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637923971 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_530744 |
Protein GI | 90422374 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.584451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGAT TGACACGACG CGATTTTGCG TTTGGATCGG CAGCGGCGGC CGCGCTACTC GCCGCGGGTC GGCCGGCCTT GGCTGCCGAC GTCGCGGTGC GTTGGGCTTC GCTGCAGCCC GGCTTCACCG TGCTGCCGGT GCAATACATC CTCGCCAACA AGCTCGGTCA AAAGCACGGG CTGACGCTGC CCGATCCCGC GCCCTACACC GCGGTCTCGA CCTACTACAA TGATTTTGTC GCCGGCAATT ACGATGTCTG CATCGGCAGC TGGGACACTT TCGCCGCGCG GCACCAGGCC GGCGTGCCGA TCAAATATCT CTGCACCATC ACCGACGCCA ATATGATTGC GCTGCTGGCG CCGAAGTCCG GCGTCGCCGA CGTGACGCAA CTCCGCGGCA AGACCATCGC GGCGCTGCAA TCGACCGGCA CCTATCGGAT GGTGCGGGCG CTGATCAAGG AAGGCAGCGG CCTCGATATC GAGAAGGACG CCACCATCCA GAACGTCGAC AATCCGGCGG CCTCGGTCAC GCTGGTGATG GCGGACCGCG CCGACGCAGC CTTGTCGTGG GAGCCGAACA TCACCACCGG CCTCGTGAAG AAGCCGGACC TGCGGGTGAT CTTCAAAGCC GGCGACGCCT ATCACAAGAT CGGCGAGGGC GACCTGCCGT ATTTCGGCGT CGGCATCCGC CAGGAGCTCT TGGACAAGAA CCCCGGCATC GCCGCCAAGA TCGCCGCCGT GTTCGAGGAT TGCCTCAAGG GCATCAACGC CGACACCGCC AAGGCGGTCG ACCTGTTCGG CGCCAAGACC GGGGTGGCCA ACGACATCTT GAAAGAGGCG ATGGGCTCGA AGCGGCTGAC CTTCAATTTC CGGCCGATGT CCGACCCGGC GTCGCGCAAA TCGGTGCTGA AGGCCAGCGA GTTCTTGGCT CGCAACGGGC TTCTGACCAA GCCGGTCGAC GACAGCTTCT TCGCGATCTG A
|
Protein sequence | MNGLTRRDFA FGSAAAAALL AAGRPALAAD VAVRWASLQP GFTVLPVQYI LANKLGQKHG LTLPDPAPYT AVSTYYNDFV AGNYDVCIGS WDTFAARHQA GVPIKYLCTI TDANMIALLA PKSGVADVTQ LRGKTIAALQ STGTYRMVRA LIKEGSGLDI EKDATIQNVD NPAASVTLVM ADRADAALSW EPNITTGLVK KPDLRVIFKA GDAYHKIGEG DLPYFGVGIR QELLDKNPGI AAKIAAVFED CLKGINADTA KAVDLFGAKT GVANDILKEA MGSKRLTFNF RPMSDPASRK SVLKASEFLA RNGLLTKPVD DSFFAI
|
| |