Gene RPB_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1191 
Symbol 
ID3910126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1367463 
End bp1368401 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content69% 
IMG OID637883085 
Producthypothetical protein 
Protein accessionYP_484812 
Protein GI86748316 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.503555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.284439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG CGCCCGAGCA GCCGAACAAG GAGACGCTGG CGGTTCGACG TGCTGATGGC 
GAGAGCCGGA CGCTCGCCGC ATCGCTGCCG CGCCTGATGC TCGAAGCCCG CCGCATCGCC
AACAACGTCA CCCACGGCCT GCACGGCCGC CGCCGCGCCG GCGCCGGCGA AAACTTCTGG
CAATATCGCC GCTTCGTCTC CGGCGAACCG GCCACCCGCG TCGACTGGCG CCGTTCGGCG
CGCGACGATC ATCTCTACGT CCGCGAGCTG GAATGGGAGG CCGCGCACAC CGTGTGGCTG
TGGCCGGACC GTTCCGCCTC GATGGCCTAC GCCTCGAAGG GCGTGCGCGA CAGCAAGCTC
GAGCGCGCCC TGATCGTGAC TTTCGCGCTG GCCGAATTGC TGGTCGCGGG CGGCGAACGC
GTCGGCATTC CCGGGCTGAT GAACCCGACC TCGAACAACA ATGTGATCGA CCGGATGGCG
CAGGCGATCC TGCACGACAC AACCTCGCGT GACAGCCTGC CGCCGTCCTT CGTGCCGTCG
TCGCTGGCCG AGATCGTGGT GCTGTCCGAC TTCTGGTCAC CGCTCGGCGA GATCCGGCAG
ATGCTTTCGG GCCTGTCCTC GTCCGGCGCG CACGGCTCGC TGGTGCAGGT CGTCGATCCT
GCGGAAGAGA GCTTTCCGTT CTCCGGACGC ATCGAATTCG TCGAGCCGGA AGGCGGCGGT
GCGATCACCG CGGGCCGCGC CGAGACCTGG GCGGCGGACT ACGTCGCGCT GGTCGCGGCG
CATCGCGATC AGATCCGAGT CGAGACCGGC ACGCTCGACT GGCTGTTTTC GACGCACACC
ACCAGCCGTT CGGCTGCCGA GCTGTTGTTG TTCCTGCACG CCGGGATGAC CACGGCCAAG
GGCGCCGAGC GCGGCACCAA AACGGGACTC GGCGCATGA
 
Protein sequence
MAQAPEQPNK ETLAVRRADG ESRTLAASLP RLMLEARRIA NNVTHGLHGR RRAGAGENFW 
QYRRFVSGEP ATRVDWRRSA RDDHLYVREL EWEAAHTVWL WPDRSASMAY ASKGVRDSKL
ERALIVTFAL AELLVAGGER VGIPGLMNPT SNNNVIDRMA QAILHDTTSR DSLPPSFVPS
SLAEIVVLSD FWSPLGEIRQ MLSGLSSSGA HGSLVQVVDP AEESFPFSGR IEFVEPEGGG
AITAGRAETW AADYVALVAA HRDQIRVETG TLDWLFSTHT TSRSAAELLL FLHAGMTTAK
GAERGTKTGL GA