Gene Rpal_3174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3174 
Symbol 
ID6410844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3419185 
End bp3420174 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID642713052 
Productprotein of unknown function DUF815 
Protein accessionYP_001992153 
Protein GI192291548 
COG category[R] General function prediction only 
COG ID[COG2607] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.535414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA AGACAAAATC CCGCCCCGCC AAAGCCGCCC CGAAGTCCGC CGCCCGCAAG 
CCCGCCCGCG CACCGGCGGC AAAGCGCCGC CCGCCGGCGA CGCCTCCCGG CGCATCCCTC
GAGGGCGCAC TGCTGGAACG GATCGCCCAC GCGCTGGAGG GCATTTCCGC CCACCTGGCG
GGCACTTCCG CCGCTCCGGC CGATGCCGCG CTGAATTCGG CCGACGCATT CATCTGGCAG
CCCGAAGGGC GGCTCGCGCC GGTTCCAAAG GTCAGCCGGG TCGATCTGTC GCTGCTGCAG
GGCGTCGACC GGATGCGCGA CACGCTGATC GAAAATACCG AGCGGTTCGC GACCGGCCTG
CCCGCCAACA ACGCATTGTT GTGGGGCGCG CGGGGGATGG GCAAATCGTC GCTGGTCAAA
GCCGCTCACG CGCATGTCAA CGCGCGGCCC GACGTCGCCG GCCGGCTGAA GCTGATCGAG
ATTCACCGCG AAGACATCGA GAGCCTGCCG GCGCTGATGA CGCTGCTGCG CGCTTCCGAC
CTCCGATTCA TCGTGTTCTG CGACGATCTG TCGTTCGACG GCAACGACGC CTCGTACAAA
TCGCTCAAGG CCGTGCTCGA AGGCGGCATT GAGGGCCGCC CCGACAACGT AATTCTTTAC
GCGACCTCGA ACCGGCGGCA TCTGCTGCCG CGCGACATGA TGGAGAACGA GCGCTCGACC
GCGATCAATC CCGGCGAAGC GGTCGAAGAG AAGGTGTCGC TGTCCGATCG GTTCGGACTG
TGGCTCGGCT TCCACAAATG CAGCCAGGAT GAATTCCTGG TGATGGTGCG CGGTTACTGC
GCACACTACG ACATCGCGAT CGACGACGAA CAGCTCGAGC GCGAAGCTCT GGAATGGTCG
ACCACGCGCG GCTCGCGCTC CGGCCGCGTC GCCTGGCAGT TCGTGCAGGA TCTCGCCGGC
CGCCTCAAGG TGCGACTCGG AACCAAGTAG
 
Protein sequence
MAKKTKSRPA KAAPKSAARK PARAPAAKRR PPATPPGASL EGALLERIAH ALEGISAHLA 
GTSAAPADAA LNSADAFIWQ PEGRLAPVPK VSRVDLSLLQ GVDRMRDTLI ENTERFATGL
PANNALLWGA RGMGKSSLVK AAHAHVNARP DVAGRLKLIE IHREDIESLP ALMTLLRASD
LRFIVFCDDL SFDGNDASYK SLKAVLEGGI EGRPDNVILY ATSNRRHLLP RDMMENERST
AINPGEAVEE KVSLSDRFGL WLGFHKCSQD EFLVMVRGYC AHYDIAIDDE QLEREALEWS
TTRGSRSGRV AWQFVQDLAG RLKVRLGTK