Gene Rpal_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0090 
Symbol 
ID6407733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp100114 
End bp101802 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content69% 
IMG OID642709999 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_001989128 
Protein GI192288523 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.735299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGCA CGGCGAAGCG CGGGCTGTTT TCCGGCGACG GGATCGCGGC GCCGCTGCGG 
CACGCACTAT TCCGGCGGAT CTGGCTGGCA AGCCTGCTGT CCAACCTCGG CCTGATGATC
AACGGCGTCG GCGCCGCCTG GGCGATGACG CAGATGACCG CGTCCGCCGA CAAGGTGGCG
CTGGTGCAGA CCGCCCTGAT GCTGCCGATC ATGCTGGTGG CGATGCCGGC GGGCGCGATC
GCCGACATGT ACGACCGCCG CCTGGTGGCG CTGGCCGCGC TCGGCATCGG CCTCGCCGGC
GCGACGACGC TGGCGGCGCT GGCGCATCTC GGGCTGGTGA CACCCAACAC CCTGCTGCTG
TTCTGCTTCG TGATCGGCAC CGGCATGGCG CTGTTCGGAC CGTCCTGGCA GGCCTCGGTG
TCCGAGCAAG TGCCGGCCGA AACCCTGCCG GCCGCAGTGG CGCTGAACGG CATCAGCTAC
AACATCGCGC GCAGCTTCGG CCCGGCGGTC GGCGGCATCG TGGTGGCGGC TGCCGGCGCG
GTGGCGGCGT TCGCCGCCAA TGCGGTGCTG TATCTGCCGC TATTGATCGT GCTGCTGCTG
TGGCGGCGGG ACAGCGAGCC ACCGCGGCTA CCGCCAGAGC GGCTGAACCG CGCGATCGTC
TCCGGCGTGC GCTATATCAC CAACTCGCCG GCAATCCGCA TTGTGCTAAC CCGCACGCTG
GTGACCGGCA TCGCTGGCTC TTCGGTGCTG GCCCTGATGC CTCTGGTGGC ACGCGACCTG
TTGCACAGCG GCGCCGAGAC CTACGGGCTG CTGCTCGGCG CATTCGGCAT CGGCGCGGTG
ATCGGCGCAC TCAATGTCGG GATTGCGCGG CAGCGCTTGA GCAGTGAAGC CGCGGTTCGG
CTGTGTGCGA TGATCATGGG CGTGGCAATG GCGGTGATCG CGATCAGCCG CTCGCCACTC
CTCACCGCAG CAGCCCTCGT CGTCGCCGGC GCGGTGTGGA TGCTGGCGAT CGCGCTGTTC
AACATCGGCG TGCAACTGTC GGCGCCGCGC TGGGTGGCGG GACGTTCGCT TGCGGCATTC
CAGGCGTCGA TCTCCGGCGG CATCGCGATC GGCAGCTGGG GCTGGGGCCA CGTTGCTGAT
CTGTCCGGCG TCGCGCCATC GATGCTGCTG TCGGGGCTGG CGATGCTGGC TTCTCCAGTG
CTCGCCTTCC TGCTGCCGAT GCCGCCGGTC GGCACCCGCA CCGAGGACGC CGAACTGCTG
GCCGATCCGG AATTGAAACT GGCGCTGACG TCGCGCAGCG GTCCGGTGGT GATCGAAATC
GAGTACCGGA TCGACGCCGA CGAAGCGCGC GCGTTTCACA ACGTGATGCA GGAGGTGCAG
CTCAGTCGCC AGCGCAACGG CGCCTATGGC TGGTCGATCG CCCGCGACGT CGCCGATCCC
GAATTATGGA CCGAGCGCTA TCACTGCCCG ACCTGGCTCG ATTATCTGCG CCAGCGCAGC
CGTTCGACCC AGGACGACCG CGCATTGCAC CGGCGCGCGA TCGCGTTTCA TCGTGGACCG
GAGCCGGTGC GGGTGCGCCG CATGCTGGAG CGGCCGTTCG GCTCGGTGCG CTGGAAAGAG
GAATCGCCCG ATCGCACTAC CGCGACCGAA GTGCTGCCGG TCGCCGGCGT CAGCGGCGGT
TCGACATAG
 
Protein sequence
MAGTAKRGLF SGDGIAAPLR HALFRRIWLA SLLSNLGLMI NGVGAAWAMT QMTASADKVA 
LVQTALMLPI MLVAMPAGAI ADMYDRRLVA LAALGIGLAG ATTLAALAHL GLVTPNTLLL
FCFVIGTGMA LFGPSWQASV SEQVPAETLP AAVALNGISY NIARSFGPAV GGIVVAAAGA
VAAFAANAVL YLPLLIVLLL WRRDSEPPRL PPERLNRAIV SGVRYITNSP AIRIVLTRTL
VTGIAGSSVL ALMPLVARDL LHSGAETYGL LLGAFGIGAV IGALNVGIAR QRLSSEAAVR
LCAMIMGVAM AVIAISRSPL LTAAALVVAG AVWMLAIALF NIGVQLSAPR WVAGRSLAAF
QASISGGIAI GSWGWGHVAD LSGVAPSMLL SGLAMLASPV LAFLLPMPPV GTRTEDAELL
ADPELKLALT SRSGPVVIEI EYRIDADEAR AFHNVMQEVQ LSRQRNGAYG WSIARDVADP
ELWTERYHCP TWLDYLRQRS RSTQDDRALH RRAIAFHRGP EPVRVRRMLE RPFGSVRWKE
ESPDRTTATE VLPVAGVSGG ST