Gene Rpal_4893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4893 
Symbol 
ID6412579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5260142 
End bp5261242 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID642714770 
ProductABC transporter related 
Protein accessionYP_001993857 
Protein GI192293252 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.293169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCA TCGATCTCGT CGATCTTGCT CACTCCTATC TGGTCGGTGA CGACCTGCCG 
CCGCAGGCCT ATGCGCTGAA GCCGGTGTCG ATGACCTGGC GGCAGGGCGG TGCCTATGCG
CTGCTCGGCC CCTCGGGCTG TGGCAAGACC ACGCTGCTCA ACATCATCTC GGGGATCGTC
ACGCCGTCGC GCGGCAAGAT CCTGTTCGAC GGCCAGGACG TCACCGAGCT CTCCACCCGC
GACCGCAACA TCGCGCAGGT GTTCCAGTTT CCGGTGATCT ACGACACCAT GACGGTGCGC
GAGAATCTGG CGTTTCCGCT GAAGAACCGC GGCGTGCCGA AGGCGGACAT CGCCAAGCGC
GTCGCCGAGA TCGCCGATCT TCTCGACCTG ACGCCGAACC TGAACCGCAA GGCGACGCGG
CTCACCGCCG ACGCCAAGCA GAAGATCTCA CTCGGCCGCG GCCTGGTGCG CAACGACGTC
GCCGCGATCC TGTTCGACGA GCCGCTGACG GTGATCGACC CGCATCTGAA GTGGGAGCTG
CGCTCCAAGC TCAAGGCGCT GCACCGCGCG CTGGATCTCA CGATGATCTA CGTCACCCAC
GACCAGACCG AAGCGCTGAC CTTCGCCGAC ACCGTGGTGG TGATGCATGA CGGCCGCGTG
GTGCAGAGCG GCACGCCGGA AGAACTGTTC GAGCGCCCGG CCCACACCTT CGTCGGCTAT
TTCATCGGCT CGCCCGGCAT GAACATTGTG CCGGCCGAGG TGAAGGGCCG CGAGGCGCTG
GTCGCCGGTC ATCGTGTGCC GCTCGCCCGC AGTTACGACA AGCTCCCCTC CGGCAAGATC
GAGATCGGGG TGCGGCCGGA ATTCGTCCAC CTCACCGCCA AGGCGCCGGG CCTGATGACG
GGCCGCGTCG AGCGGATCGA CGATCTCGGC CGCATTCGCT TTGCCTCGGT GCGAGTCGGC
GACGTCAAGT TCGCCGCGCG CGTGCCGGAT GGCTTCTCGC CCAGTGGGGA TGAAGCCGCG
CTGCGGTTCG AGCCGTCCCG CATCCACGTC TATGCCGACA GCGAAATCGT CGAAGGCAGC
TCACTGGAGC AGGTCGCCTG A
 
Protein sequence
MARIDLVDLA HSYLVGDDLP PQAYALKPVS MTWRQGGAYA LLGPSGCGKT TLLNIISGIV 
TPSRGKILFD GQDVTELSTR DRNIAQVFQF PVIYDTMTVR ENLAFPLKNR GVPKADIAKR
VAEIADLLDL TPNLNRKATR LTADAKQKIS LGRGLVRNDV AAILFDEPLT VIDPHLKWEL
RSKLKALHRA LDLTMIYVTH DQTEALTFAD TVVVMHDGRV VQSGTPEELF ERPAHTFVGY
FIGSPGMNIV PAEVKGREAL VAGHRVPLAR SYDKLPSGKI EIGVRPEFVH LTAKAPGLMT
GRVERIDDLG RIRFASVRVG DVKFAARVPD GFSPSGDEAA LRFEPSRIHV YADSEIVEGS
SLEQVA