Gene Rpal_5274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5274 
Symbol 
ID6412975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5688105 
End bp5689433 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content65% 
IMG OID642715164 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001994236 
Protein GI192293631 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGAA TTTTGGGAAG ACTCGCGCAG ATCGCCGCCG CCGTGCTGGT TTGCGCCACC 
GCTGGTATCG TCCCCGCTGG CGCCGCCACC GAAATTGCCT GGTGGCATGC GATGTCCGGT
GAGCTCGGCC GGCAGCTTGA GAAACTGGCG GCGGATTTCA ATGCGTCGCA ATCCGATTAC
CGCGTGGTCC CGACCTACAA GGGCAATTAC ACCCAGACGG TGACCGCGGC GATCTTCGCG
TTTCGCTCCT CCAGCCAGCC GACGATCGTG CAGGTCAACG AGATCGCCAC CGCCACCATG
ATGGCGGCCA AGGGCGCGGT CTATCCGGTG TACGAGCTGA TGCGCGACGA GAGCGAGGTG
TTCTCGCCGG CCGACTATCT GCCGGCCGTC ACCGGGTACT ACACCGATCT TAGCGGCAAC
ATGCTGTCGT TTCCGTTCAA CGCGTCGACA CCAATTCTGT ACTACAACAA GACGCTGTTT
CGTCGCGCTG GGCTCGATCC GGAGGTGCCG CCGCCGACTT GGCCGGAAGT CGGGACGATG
GCGAAGCGGC TGATCGACGC CGGCGCAGCG TGCGGCTTCA CCACCTCGTG GCCGTCCTGG
GTGCATATCG AGAACTTTTC CGCCTATCAC AACCTGCCGC TGGCGACCCA GTCGAACGGG
CTGGGCGGGC TTGATGCCGA ACTGGTGTTC AACAATCCGG CGGTGGTGCG CCATATCGCG
CAGCTTGCCG ATTGGCAGAA GACCAAGACC TTCGATTACG GCGGCCGCGC CACCGCGGCC
GAACCGCGCT TCCAGCAGGG TGACTGCGGC ATCTTCATCG GCTCATCGGC AACGCGGGCC
GACATCCTGG CCAACGCCAA GTTCGATGTC GGCTACGGCC GGCTGCCGTA TTGGCCGGAC
ATCGCCGGCG CGCCGCAGAA CACCATCATC GGCGGTGCCA CACTATGGGT GCTGCGCGGC
CATTCGGCGG GCGAATACAA AGGCGCCGCC AAGTTCTTCG CCTACCTGTC GAAGCCGGAA
GTTCAGGCGG CCTGGCATCA GCACACCGGC TACCTGCCGA TCACAAAGGC GGCCTACGAT
CTCACCCGCG CCCAGGGCTT CTACGACCGC AATCCCGGCA CCGCGATCTC GATCGAACAG
ATCACGCTGA AGCCGCCGAC CGAGAATTCG CGCGGGCTGC GGCTCGGCTC GTTCGTGCTG
GTGCGGGCGG CGATCGAAGA CGAGATCGAA CACGCGGTGC GGGGCGATAA GCCGGCGAAA
GAGGCGATGG ACGCGGCGGT CGAGCGCGGC AACAAGCTGC TGCGGCAGTT CGAACGCACC
AAGCCGTAA
 
Protein sequence
MAGILGRLAQ IAAAVLVCAT AGIVPAGAAT EIAWWHAMSG ELGRQLEKLA ADFNASQSDY 
RVVPTYKGNY TQTVTAAIFA FRSSSQPTIV QVNEIATATM MAAKGAVYPV YELMRDESEV
FSPADYLPAV TGYYTDLSGN MLSFPFNAST PILYYNKTLF RRAGLDPEVP PPTWPEVGTM
AKRLIDAGAA CGFTTSWPSW VHIENFSAYH NLPLATQSNG LGGLDAELVF NNPAVVRHIA
QLADWQKTKT FDYGGRATAA EPRFQQGDCG IFIGSSATRA DILANAKFDV GYGRLPYWPD
IAGAPQNTII GGATLWVLRG HSAGEYKGAA KFFAYLSKPE VQAAWHQHTG YLPITKAAYD
LTRAQGFYDR NPGTAISIEQ ITLKPPTENS RGLRLGSFVL VRAAIEDEIE HAVRGDKPAK
EAMDAAVERG NKLLRQFERT KP