Gene Rpal_4778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4778 
Symbol 
ID6412464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5142575 
End bp5143504 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content66% 
IMG OID642714657 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_001993744 
Protein GI192293139 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.289299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA CGACGCCGCC CGCTGAAGCG CACCGCTTCG GCTGGATCTC CGACCAAGCC 
TATCTGCTGC TCAGCCTGAC CTCGCTGTTC TGGGCCGGCA ACGCCATTGT CGGCCGTGCC
ATCGCCGGGC ACTTCCCGCC GGTGACGCTG TCTTTCCTGC GCTGGACCTG CGCGTTCCTG
ATCGTCGCGC CGTTCGCGTG GCGGCACCTG ATCGCCGACT GGAAGGTGAT CCGCCGGCAT
CTGCCGCTGA TGATGTCGGT GTCGATCATC GGCATCTCGA CCTTCAACAC GCTGCAGTAC
ACCGCCCTGC AATACACCTC GGCACTCAAC GTGCTGTTGC TGCAATCGAC CGCGCCGCTG
TTCGTGGCGC TGTGGGCGCT GATCGTGCTG CGGATGCGGC TGACGCTGAC CCAGGCGATC
GGCATCGGCT CGTCGATGAT CGGCGTGGTG GTGATCATCC TGCACGGCAA CATCGCCGAA
CTCGCCGCGA TCGATCTCAA CCGCGGCGAC GTGCTGTTCG TCTGCGCGCT GGCGAGCTTC
GGGCTCTACA CCACGCTGAC GCAGAAGCGG CCGCCGATGC ATGCGCTGTC GTTTCTCGGC
TTCACCTTCG GCTGCGGCGC GCTGTTCCTG ATCCCGCTGG AGATCTGGGA GCTGAGCACC
ACGCCGCTGC CGGCATTCGA CTGGGCCAAT CTCGGTGCGC TCGCCTATGT GGCGGTGTTC
CCGTCGATCC TGGCCTATCT CTGCTACAAC CGCGGCGTCC GGCTGATCGG CGCCAACCGC
TCGGCGCCGT TTTTCCATCT GGTGCCGGTG TTCGGCTCGG CGATGGCGAT CCTGTTTCTC
GGCGAGCAAC CGCACCTCTA CCATGCCGTC GGCTACGCGC TGGTGCTGAC CGGCGTGGTG
ATCGCGGCGC GCAAGCCGAA GACGGCGTGA
 
Protein sequence
MTTTTPPAEA HRFGWISDQA YLLLSLTSLF WAGNAIVGRA IAGHFPPVTL SFLRWTCAFL 
IVAPFAWRHL IADWKVIRRH LPLMMSVSII GISTFNTLQY TALQYTSALN VLLLQSTAPL
FVALWALIVL RMRLTLTQAI GIGSSMIGVV VIILHGNIAE LAAIDLNRGD VLFVCALASF
GLYTTLTQKR PPMHALSFLG FTFGCGALFL IPLEIWELST TPLPAFDWAN LGALAYVAVF
PSILAYLCYN RGVRLIGANR SAPFFHLVPV FGSAMAILFL GEQPHLYHAV GYALVLTGVV
IAARKPKTA