Gene RPC_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3408 
Symbol 
ID3970452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3793770 
End bp3795260 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content72% 
IMG OID637926519 
Producthypothetical protein 
Protein accessionYP_533267 
Protein GI90424897 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.112401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT ACAAGACGCC GGCGATTGTG GCGCTGGCGG CGCTGCTGGC GTTCGCGGCG 
CTCACCGGGG ACGCCGCCGC CAAGCGGGCC CGCGCGCCGG CGCCGACCGA GACCGCGGCG
CCGCGCCCGG CCGGCGAGCC GATCATGGCG ATCGTGTCGA TCTCCGCGCA GAAGGTCACG
GTCTACGACG CCGACGGCTG GATCTTGCGG GCGCCGGTCT CCACCGGCAC CACCGGGCGC
GAGACCCCGG CCGGGGTGTT CGCGGTGGTC GAGAAGGACA AGGACCACCA TTCGACGATG
TATGACGACG CCTGGATGCC GAACATGCAG CGCATCACCT GGAACGGCAT CGCGCTGCAC
GGCGGCCCGC TGCCGGGCTA CGCCGCCTCG CACGGCTGCG TGCGGATGCC GTATGATTTT
GCCGAAAAAC TGTTCGACAA GACCAATATC GGGATGCGGG TGATCGTCGC GCCGAACGAC
GCAGCGCCGG TGTCGTTCGC CCACCCGGCC CTGTTCACGC CGAAGGCCGA GGCGCTGGCG
ACGGCGCCGG CGCGCGCCGA GATGCTGAGC CGCGAGGCCG CGGAGGCCAG CGCCACCGCC
GAGGCGGCCA AGAAGGCCAA CGCCGCGGCA AGCCGCGACG CATCCGCGCT CGCCGCGGCG
CTGCGCAAGC TGGAGAAGGC CAAAGCGCGC GCCGATGCCC AGCTGAAGGC GGCCGACAAG
GCGCTCGCCG CCGCGACCGA GCCGAACAGG CCCCGACTGG ACGAGCGGCA GCAGATCGCC
GCGCAAAACG CCGCGGACGC CGCGGCGCAG CTCGACGCCG CCCGCGCCGA CGCCGAGACC
AAGCGCGCCG CCGCCTTGGC GGCCAAGGAC GCCGCGAAGT CCGCCGCCGC CGCCAAGGCC
ACCGCCGTGA CGGCGGCCAA CGAGGCCAAG CTCGCGCTGG AGCCGGTGTC GATCTACATC
AGCCGCGCGA CGCAGACGCT GTACGTCCGC CGCAACACCC ATAAGCCGTG GCCGGACGGC
GGCGAGGTGT TCGACGCCAG CATCGAAATT CCGATCAGCA TCCGCGATCC CGATCGGCCG
ATCGGCACCC ACGTGTTCAC CGCGATGGCG CGTGACGAGA GCGGGCTGCG CTGGAGCGCG
GTGACGATCG ATCATGGCGA CGACGCCAAA GCCGCGCTCG ACCGCATCAG CTTTCCGCAG
GACGAGCTTG CGCGGATCGG CGTCACCGCG ATGCCGCGGT CCTCGATCGT GGTCTCGGAC
GAGCCGCTGA GCAAAGAGAC CAACTATCGC ACCGAATTCG TCGCGGTGCT GAGCAACCAG
CCGCAGGGCG GCTTCATCAC CCGCAAGCCG ACCGTGCCTG CGCCTGCGCC TGCGATGGCC
GAGCGCGACG ACGGCGACGA TTTCTTCAGC TTCTTCCAGC GCAACCAGGG CCCCGCCGTG
CCGCAACGCC GCGGCCCGGG CTTTGCCCCC GGCCCGCGCG GCTGGTGGTA G
 
Protein sequence
MQRYKTPAIV ALAALLAFAA LTGDAAAKRA RAPAPTETAA PRPAGEPIMA IVSISAQKVT 
VYDADGWILR APVSTGTTGR ETPAGVFAVV EKDKDHHSTM YDDAWMPNMQ RITWNGIALH
GGPLPGYAAS HGCVRMPYDF AEKLFDKTNI GMRVIVAPND AAPVSFAHPA LFTPKAEALA
TAPARAEMLS REAAEASATA EAAKKANAAA SRDASALAAA LRKLEKAKAR ADAQLKAADK
ALAAATEPNR PRLDERQQIA AQNAADAAAQ LDAARADAET KRAAALAAKD AAKSAAAAKA
TAVTAANEAK LALEPVSIYI SRATQTLYVR RNTHKPWPDG GEVFDASIEI PISIRDPDRP
IGTHVFTAMA RDESGLRWSA VTIDHGDDAK AALDRISFPQ DELARIGVTA MPRSSIVVSD
EPLSKETNYR TEFVAVLSNQ PQGGFITRKP TVPAPAPAMA ERDDGDDFFS FFQRNQGPAV
PQRRGPGFAP GPRGWW