Gene Rpal_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4894 
Symbol 
ID6412580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5261255 
End bp5262334 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID642714771 
ProductABC transporter related 
Protein accessionYP_001993858 
Protein GI192293253 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.332035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCA GCCTGGAGAA TGTCACCAGG GTGCTGGACG GCGTTCCGGC GATCCGCAAC 
GTATCGCTGA CGCTGGAACG CGGTACGCTC AGCGTGCTGC TCGGACCAAC CCTGTCGGGC
AAGACCTCGA TCATGCGGCT GCTCGCCGGC CTCGACAAGC CGACCACCGG TCGTGTGCTG
GTCGACGGCA AGGACGTCAC CGGCGCCGAT GTACGCACCC GCTCGGTGGC GATGGTGTAT
CAGCAGTTCA TCAACTACCC GTCGCTGACC GTGTACGAGA ACATCGCCTC GCCGCTGCGC
GTGCAGCGCA AGTCGCGCGC CGAGATCGAG CAGCGCGTCC AGGAGGCAGC CAAGCTGCTC
AAGCTCGAGC CGTATCTGCA GCGCACGCCG CTGCAGCTTT CCGGCGGCCA GCAGCAGCGC
ACCGCGATCG CCCGCGCGCT GGTCAAGGGT GCCGATCTGG TGTTGCTCGA CGAGCCGCTC
GCCAACCTCG ACTACAAACT GCGCGAGGAA CTGCGCACCG AGCTGCCGAA GATCTTCGAG
GCCTCCGGCG CGATCTTCGT TTACGCCACC ACCGAGCCTT CCGAAGCGCT GCTGCTCGGC
GGCCGCACCA TCTGCATGTG GGAAGGTCAG GCGCTACAGG TCGGGCCGAC CCCGCAGGTG
TATCGCAAGC CCGACACCAT GCGGGTGGCG CAGGTGTTCT CCGATCCGCC GCTCAACATC
GTCGGCGCCG AGAAGAAGGC CGGCACCGTG CATTACTCCG GCGGCGTCAC CGCGCCCGCG
ACTGGTGTAT TCGCGAGCCT TTCCGATGGC GCCTATCGCG TCGGCTTCCG TGCGCATCAG
ATCGAGGTGA AGAGCGCCGA TCCGGATCGC CACGCGTTCC GAGCCACCGT CGCGGTGACC
GAGATCACCG GCTCTGAGAG CTTCGTGCAT CTGAAGCGCG GCGACGATTA CTGGGTCGCG
GTGCTGCACG GCATCCACGA GTTCGAGCCG GGCCAGACGC TCGACGCCAT CCTCGACCCC
GCCAATCTGT TCGTGTTCGA CGCGGCTGAT CGCCTCGTCG CCGCGCCGAA GCCGATCTGA
 
Protein sequence
MSVSLENVTR VLDGVPAIRN VSLTLERGTL SVLLGPTLSG KTSIMRLLAG LDKPTTGRVL 
VDGKDVTGAD VRTRSVAMVY QQFINYPSLT VYENIASPLR VQRKSRAEIE QRVQEAAKLL
KLEPYLQRTP LQLSGGQQQR TAIARALVKG ADLVLLDEPL ANLDYKLREE LRTELPKIFE
ASGAIFVYAT TEPSEALLLG GRTICMWEGQ ALQVGPTPQV YRKPDTMRVA QVFSDPPLNI
VGAEKKAGTV HYSGGVTAPA TGVFASLSDG AYRVGFRAHQ IEVKSADPDR HAFRATVAVT
EITGSESFVH LKRGDDYWVA VLHGIHEFEP GQTLDAILDP ANLFVFDAAD RLVAAPKPI