Gene Rpal_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4213 
Symbol 
ID6411897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4517121 
End bp4518986 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content64% 
IMG OID642714095 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001993184 
Protein GI192292579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.238618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGC TTAACCGCCG CAACGTGCTC GCTCTCGGCG TTGGCGCGCT GGCTGCCACG 
CATCTCCGGG GCACCGCCGC CGCGGCCGAA GGCGAGACGA TCGCCCACGG CATGTCGGCT
TTCGGTGACC TGAAGTACCC GGCCGACTTC GCGCATTTCG ACTATGTCGA TCCGCGAACT
CCAAAAGGCG GACTGTTCTC CACGATCCCG TCGGTTCGGG CGTTCAACCA GTCGTTTCAG
ACGTTCAACT CGCTCAACGC CTATATTCTC AAAGGGGACG GCGCCCAGGG CATGGGGCTG
ACCTTCGCGA CGCTGATGGC GCGTGCCGGC GACGAGCCCG ACGCGATGTA CGGCTTCGCG
GCCTCCAAAG TGGCGATCTC TGCCGACGGC CTGGCCTATC GCTTCACGAT GCGGCCGGAA
GCACGTTTCC ACGACGGCAG CAAACTGACG GCGCGCGACG CCGCTTTCTC GCTGAACATC
TTGAAGGCGA AGGGCCACCC GATCGTCACG CAACAAATGC GCGACTTCAT CGAGGCTGTG
GCGACCGATG ACGCGACGCT GGTGGTGACC TTCAAGCCGA AGCGCGGCCG CGACGTGCCG
CTGTTCGTCG CCGGCCTGCC GCTGTTCTCG GAAACTTACT ATTCGAAACA GCCGTTCGAT
GAATCCACCA TGGATGTGCC GCTCGGGAGC GGGCCCTACA AGGTCGGACG GCTCGAATCC
GGTCGCTACA TCGAGTTCGA TCGGGTCAAG GATTGGTGGG GCGCGAAGCT GCCGGTGAAT
GTCGGGGCTT ACAATTTCGA CATCGTTCGG TTCGAGTTCT ATCGCGATCG CGACGTTGCG
TTCGAAGGCT TCACCGGGCG CAGCTATCTG TTTCGCGAGG AGTTCACCTC GCGGATCTGG
AACACCCGCT ACGATTTCCC CGCGATCCAT GACGGCCGCG TCAAGCGCGA GATCCTGCCG
GACGACACCC CGTCGGGCGC ACAGGGCTGG TTCATCAACA CCCGCCGCGA CAAGTTCAAG
GATCCGCGCG TCCGCGAGGC GCTCGGCTGC GCGTTCGATT TCGAGTGGAC CAACAAGACC
ATCATGTACG GCACCTATGC GCGCACGGTG TCGCCATTCC AGAATTCCGA CATGATGGCG
GTGGGCGCGC CGTCGCCCGA AGAGTTGGCG CTGCTCGAAC CGTTCCGCGG CAAGGTGCCC
GACGAAGTGT TCGGGACACC GTTCATACCG CCCGCATCTG ACGGCTCTGG ACAGGACCGG
GCGCTGCTGC GCCGGGGCGG GCAGCTGTTG AACGAGGCTG GCTTTCCGAT CAAGAACGGC
AAACGTCTGA CGCCTCAGGG GGAGCCGTTC CGGGTCGAAT TCCTGCTCGA AGAGCCGGCA
TTCCAGCCGC ACCATATGCC GTTCATCAAG AACCTCGGCA CGCTCGGCAT CGACGCCACG
TTGAGGCTGG TCGATCCGGT GCAACTGCGG GCGCGCCGTG ACGATTTTGA TTTCGATCTG
ACGATCGAGC GCTACAGCTT TTCGACCGTG CCGGGCGACG CGCTGCGCAA CTTCTTCTCG
TCGCAGGCGG CAGCCACCAA GGGCTCGAAC AATCTCGCCG GCATTTCCGA TCCGGCCATC
GACGCGATGA TCGATCAGGT GATCGCGGCC GACACCCGCA CCAAACTGGT TGTTGCGGCG
CGCGCGCTTG ATCGACTGAT CCGGGCTGGC CGTTATTGGG TGCCGCAATG GTACTCGGCC
TCGCACCGGC TGGCCTATTG GGACGTGTTC TCCCATCCGC CGAGTCTGCC GAAATACGCC
GGCGTCGGCG TGCCGGAGCT GTGGTGGGCG ACCGCCCCTG CGGCACCTGC CGGCCAAGGG
AAATAG
 
Protein sequence
MAQLNRRNVL ALGVGALAAT HLRGTAAAAE GETIAHGMSA FGDLKYPADF AHFDYVDPRT 
PKGGLFSTIP SVRAFNQSFQ TFNSLNAYIL KGDGAQGMGL TFATLMARAG DEPDAMYGFA
ASKVAISADG LAYRFTMRPE ARFHDGSKLT ARDAAFSLNI LKAKGHPIVT QQMRDFIEAV
ATDDATLVVT FKPKRGRDVP LFVAGLPLFS ETYYSKQPFD ESTMDVPLGS GPYKVGRLES
GRYIEFDRVK DWWGAKLPVN VGAYNFDIVR FEFYRDRDVA FEGFTGRSYL FREEFTSRIW
NTRYDFPAIH DGRVKREILP DDTPSGAQGW FINTRRDKFK DPRVREALGC AFDFEWTNKT
IMYGTYARTV SPFQNSDMMA VGAPSPEELA LLEPFRGKVP DEVFGTPFIP PASDGSGQDR
ALLRRGGQLL NEAGFPIKNG KRLTPQGEPF RVEFLLEEPA FQPHHMPFIK NLGTLGIDAT
LRLVDPVQLR ARRDDFDFDL TIERYSFSTV PGDALRNFFS SQAAATKGSN NLAGISDPAI
DAMIDQVIAA DTRTKLVVAA RALDRLIRAG RYWVPQWYSA SHRLAYWDVF SHPPSLPKYA
GVGVPELWWA TAPAAPAGQG K