Gene Rpal_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2033 
Symbol 
ID6409693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2205842 
End bp2206831 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID642711919 
Productarginine/ornithine transport system ATPase 
Protein accessionYP_001991031 
Protein GI192290426 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAG CCCGACGCAC CGGCGACATT GCCCATTTCG CCAGCGAGGT CCGCAGCGGC 
CATCGCGCCG CGCTGGCGCG CGCCATCACC CTGATCGAAA GCCGCCGAGC CGACCACCAG
GCGTCCGCGC GCGCGCTGGT GCAGGACCTG CTGCCCGAAA CCGGCCGCGC GATCCGCGTC
GGCATCACCG GCTCACCCGG CGTCGGCAAG TCGACCACCA TCGACGCGCT CGGCATGTAT
CTGATCGAGC AGGGCCACAA GGTCGCGGTG CTGGCGGTCG ATCCGTCATC GGCGCGCACC
GGCGGGTCAA TTCTCGGCGA CAAGACCCGG ATGGCGCGGC TATCCGCCGA GCCGGACGCC
TTCATCCGTC CCTCGCCCTC CTCCGGCACG CTTGGCGGCG TCGCGGCGAA GACCCGCGAG
GCGATGCTGC TGTGCGAGGC CGCCGGCTTC GACGTCGTGC TGGTCGAGAC CGTCGGCATC
GGCCAATCCG AAACCGCGGT CTGCGATATG ACCGATTTCT TTCTGGCGCT GATGCTGCCG
GGCGCCGGCG ACGAGTTGCA GGGCATCAAG AAGGGCCTGG TCGAGCTCGC CGACATGATC
GCCATCAACA AGGCGGACGG CGACAACATC AAGCGCGCCA ATCTCGCCGC CGGCGAGTAT
CGCGCGGCGC TGCATATCCT CACCCCGCGT TCGCCGAACT GGCAGCCGCC GGTGAAGACC
TATTCGGCAA TGACCGGCAC CGGCATCGCC GAGCTGTGGC AGGCGATCCT GGACCATCGC
GCCGCCACCA CGCCGTCGGG CGAGTTCGAC GCCCGCCGCC GCGAGCAGCA GGTGAAGTGG
ATGTGGACGC TGCTCGAGGA TCGTTGGAAG GCGAAGCTGC GCAGCGACCC GGCGATCCGC
GCCAAGGTGA AATCGACCGA GGCCGCCGTC GCCGAAGGCT CGATCACGCC CACGCTCGGC
GCCGATCGGG TCGCGGAGCT GATCGGGTGA
 
Protein sequence
MNSARRTGDI AHFASEVRSG HRAALARAIT LIESRRADHQ ASARALVQDL LPETGRAIRV 
GITGSPGVGK STTIDALGMY LIEQGHKVAV LAVDPSSART GGSILGDKTR MARLSAEPDA
FIRPSPSSGT LGGVAAKTRE AMLLCEAAGF DVVLVETVGI GQSETAVCDM TDFFLALMLP
GAGDELQGIK KGLVELADMI AINKADGDNI KRANLAAGEY RAALHILTPR SPNWQPPVKT
YSAMTGTGIA ELWQAILDHR AATTPSGEFD ARRREQQVKW MWTLLEDRWK AKLRSDPAIR
AKVKSTEAAV AEGSITPTLG ADRVAELIG