Gene Rpal_4323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4323 
Symbol 
ID6412007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4650612 
End bp4651829 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID642714205 
ProductVWA containing CoxE family protein 
Protein accessionYP_001993294 
Protein GI192292689 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTA ATCCGACCGC GATGATCGAC CATCTCAATC CGCCAACCGG CAAGATGGCG 
GACAATGTGG TCGGCTTTGC GCGGGCGCTG CGCGCGGCGG GGCTGCCGGT CGGACCGGGC
GCGGTGATCG ATGCGCTGGA CGCGCTGCAG CTGATCGAGA TCGGCCATCG CGACGATCTC
TACGCCACGC TGGAAGCGAT CTTCGTCAAG CGCCGCGAAC ATCTGTTGAT CTTCGACCAG
GCGTTCGCGC TGTTCTTCCG CGCCGCCGAG GATTGGCAGC ACATGCTGGA CTCGATCCCG
CTGCCGGACG CCGCCAAGAA AAAGCCGCCG CCGGCCTCGC GCCGGGTGCA GGAAGCGATG
TCGCCGGCGG CGACGCGCGA CATGCCGTCG GCCGAGGAGC AGGAATTGCG GCTCGCCGTC
TCCGACAAGG AAATCCTGCA GAAGAAGGAC TTCGCGCAGA TGAGCGCGGC GGAGATCGCC
GAGGTGACCC GCGCGATCGA ACGGATGAGG TTGCCGCAGG CCGAGCTGCG CACCCGGAGA
GTGAGGCCCG ATCGTCGCGG CCTGAAGCTC GACCTGCGCC GCACCTTGCG CGCGTCGTTG
CGGACCGGCG GCGAGGTCGT CGATATCAAG CGGCTCGGTC TGATCGACAA GCCAGCGCCG
ATCGTCGCGC TGCTCGATAT CTCCGGGTCG ATGAGCGAAT ACACGCGGCT GTTCCTGCAC
TTCCTCCACG CCATCACCGA TGACCGCAAG CGGGTGTCGA CCTTCCTGTT CGGCACAAGG
CTGACCAACG TCACCCGTGC GCTGCGGCAG CGCGATCCGG ACGAAGCGCT GGCGAGCTGC
ACCTCCTCGG TCGAGGACTG GGCCGGCGGC ACGCGGATCG CCACCTCGCT GCACAGCTTC
AACAAGCTGT GGGCGCGGCG GGTGCTCGGC CAAGGTGCGA TCGTGCTGCT GATCTCCGAC
GGGCTGGAGC GCGAGAGCGA CTCCAAGCTG GCGTTCGAGA TGGACCGGCT GCATCGCTCC
TGCCGCCGGC TGATCTGGCT CAATCCGCTG CTGCGCTACG ACGGTTTCGA GCCGCGCGCC
CAGGGCATCA AAATGATGCT ACCCCACGTT GACGAATTCC GCCCGGTGCA TAATTTGACC
TCGATGCACA CGCTGATCGC GGCGCTGTCG TCGGCACCGC CGCCGCACCA TTTCAGCACG
ATCCGTTCCG TCGCCTGA
 
Protein sequence
MQRNPTAMID HLNPPTGKMA DNVVGFARAL RAAGLPVGPG AVIDALDALQ LIEIGHRDDL 
YATLEAIFVK RREHLLIFDQ AFALFFRAAE DWQHMLDSIP LPDAAKKKPP PASRRVQEAM
SPAATRDMPS AEEQELRLAV SDKEILQKKD FAQMSAAEIA EVTRAIERMR LPQAELRTRR
VRPDRRGLKL DLRRTLRASL RTGGEVVDIK RLGLIDKPAP IVALLDISGS MSEYTRLFLH
FLHAITDDRK RVSTFLFGTR LTNVTRALRQ RDPDEALASC TSSVEDWAGG TRIATSLHSF
NKLWARRVLG QGAIVLLISD GLERESDSKL AFEMDRLHRS CRRLIWLNPL LRYDGFEPRA
QGIKMMLPHV DEFRPVHNLT SMHTLIAALS SAPPPHHFST IRSVA