Gene RPC_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0855 
Symbol 
ID3969813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp943840 
End bp944820 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID637923971 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_530744 
Protein GI90422374 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.584451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGAT TGACACGACG CGATTTTGCG TTTGGATCGG CAGCGGCGGC CGCGCTACTC 
GCCGCGGGTC GGCCGGCCTT GGCTGCCGAC GTCGCGGTGC GTTGGGCTTC GCTGCAGCCC
GGCTTCACCG TGCTGCCGGT GCAATACATC CTCGCCAACA AGCTCGGTCA AAAGCACGGG
CTGACGCTGC CCGATCCCGC GCCCTACACC GCGGTCTCGA CCTACTACAA TGATTTTGTC
GCCGGCAATT ACGATGTCTG CATCGGCAGC TGGGACACTT TCGCCGCGCG GCACCAGGCC
GGCGTGCCGA TCAAATATCT CTGCACCATC ACCGACGCCA ATATGATTGC GCTGCTGGCG
CCGAAGTCCG GCGTCGCCGA CGTGACGCAA CTCCGCGGCA AGACCATCGC GGCGCTGCAA
TCGACCGGCA CCTATCGGAT GGTGCGGGCG CTGATCAAGG AAGGCAGCGG CCTCGATATC
GAGAAGGACG CCACCATCCA GAACGTCGAC AATCCGGCGG CCTCGGTCAC GCTGGTGATG
GCGGACCGCG CCGACGCAGC CTTGTCGTGG GAGCCGAACA TCACCACCGG CCTCGTGAAG
AAGCCGGACC TGCGGGTGAT CTTCAAAGCC GGCGACGCCT ATCACAAGAT CGGCGAGGGC
GACCTGCCGT ATTTCGGCGT CGGCATCCGC CAGGAGCTCT TGGACAAGAA CCCCGGCATC
GCCGCCAAGA TCGCCGCCGT GTTCGAGGAT TGCCTCAAGG GCATCAACGC CGACACCGCC
AAGGCGGTCG ACCTGTTCGG CGCCAAGACC GGGGTGGCCA ACGACATCTT GAAAGAGGCG
ATGGGCTCGA AGCGGCTGAC CTTCAATTTC CGGCCGATGT CCGACCCGGC GTCGCGCAAA
TCGGTGCTGA AGGCCAGCGA GTTCTTGGCT CGCAACGGGC TTCTGACCAA GCCGGTCGAC
GACAGCTTCT TCGCGATCTG A
 
Protein sequence
MNGLTRRDFA FGSAAAAALL AAGRPALAAD VAVRWASLQP GFTVLPVQYI LANKLGQKHG 
LTLPDPAPYT AVSTYYNDFV AGNYDVCIGS WDTFAARHQA GVPIKYLCTI TDANMIALLA
PKSGVADVTQ LRGKTIAALQ STGTYRMVRA LIKEGSGLDI EKDATIQNVD NPAASVTLVM
ADRADAALSW EPNITTGLVK KPDLRVIFKA GDAYHKIGEG DLPYFGVGIR QELLDKNPGI
AAKIAAVFED CLKGINADTA KAVDLFGAKT GVANDILKEA MGSKRLTFNF RPMSDPASRK
SVLKASEFLA RNGLLTKPVD DSFFAI