Gene Rpal_4009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4009 
Symbol 
ID6411691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4295934 
End bp4297448 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content66% 
IMG OID642713891 
Productprotein of unknown function DUF112 transmembrane 
Protein accessionYP_001992980 
Protein GI192292375 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTGT TCTCCAATCT CGCGCTCGGC TTCCAGGTCG CCGCTTCGCC GACAAACCTG 
CTGCTGTGCC TCACCGGCGC GCTCGTCGGC ACCCTGATCG GCGTACTGCC GGGCATCGGC
ACCATCGCCA CCGTGGCGAT GCTGCTGCCG ATCACTTTTG GCCTGCCGCC GGTCGGTGCG
CTGATCATGC TCGCCGGCAT CTATTACGGC GCCCAATACG GCGGTTCGAC CACCTCGATC
CTGGTCAACA TTCCGGGCGA GGCGACCTCG GTGGTGACCA CGCTCGACGG CTTTCAGATG
GCCAAGCAGG GCCGCGCCGG CCCGGCGCTG GCGATCGCCG CGATCGGCTC GTTCGCCGCC
GGCTGCTTCG CCACCGTGCT GATCGCGCTG GTCGGCGAGC CCTTGACCCG GCTGGCGCTG
GAGTTCGGTC CGGCCGAGTA CTTCTCGCTG ATGGTGCTGG GTCTGGTGTT CGCCGTGGTG
CTGGCGCGCG GGTCGGTGCT GAAGGCGGTG GCGATGATCG TGCTCGGGCT GCTGCTGTCG
ACCGTCGGCT CCGACATCGA AACCGGCGTC TCGCGCATGA CCTTCGATGT CCCGGAACTG
GCGGACGGGC TCGGCTTCGC CACGGTGGCG ATGGGCGTGT TCGGTTTCGC CGAGATCATC
CGCAACCTGG ATTTCGGCGC CGCGACCGAC CGCGAGCTGG TGCAACAGAA GATCACCGGC
TTGATGCCGA CCCGGAAGGA TCTGCGCGAC GCGGCGCCGG CGATCGGCCG CGGCACCATC
CTCGGCTCCC TCCTCGGTAT CCTGCCCGGC GGCGGGGCGG TGATCGCCTC GTTCGCGGCC
TACACGCTGG AGAAGAAGAT CGCGCGCGAC CCGAAACGGT TTGGCCGCGG CGCGATCGAA
GGCGTCGCGG CGCCGGAAAG CGCCAATAAC GCCGCCGCCC AGACCTCGTT CATCCCGCTG
CTGACGCTCG GGATCCCGCC GAACGCCGTG ATGGCCCTGA TGGTCGGCGC GATGACCATT
CACAACATTG TACCGGGGCC GCAGGTGATG AAGAACCAGC CTGAACTGGT CTGGGGCATG
ATCGCCTCGA TGTGGATCGG CAACCTGATG CTGCTGGTGA TCAATTTGCC GCTTGTGGGT
ATTTGGGTAC GATTATTGCG TGTTCCGTAC CGCTTGATGT TTCCGTCGAT CGTGGTGTTC
TGCTGTATCG GGATCTACTC GGTGAACAAC GCGCCGGTGG ACGTGGTCCT GGCCGGCGCG
TTCGGGCTGA TCGGTTACTG GCTGGTGAAG CACGATTTCG AGCCGGCGCC GCTATTGCTC
GGCATGGTGC TGGGGCCGCT GATGGAGGAC AATTTGCGGC GTGCACTGCT GATTTCGCGT
GGTGATGCCT CGGTATTCAT CACCCGGCCG CTGTCGGCCT CGCTGCTGGT CATCGCTGCC
GGCCTGCTGA TCCTGTCGGT ATTACCGATG CTGCGGCGCA AGCGTGACGA AGTGTTCGTC
GAGTCCGAGG GGTAA
 
Protein sequence
MDLFSNLALG FQVAASPTNL LLCLTGALVG TLIGVLPGIG TIATVAMLLP ITFGLPPVGA 
LIMLAGIYYG AQYGGSTTSI LVNIPGEATS VVTTLDGFQM AKQGRAGPAL AIAAIGSFAA
GCFATVLIAL VGEPLTRLAL EFGPAEYFSL MVLGLVFAVV LARGSVLKAV AMIVLGLLLS
TVGSDIETGV SRMTFDVPEL ADGLGFATVA MGVFGFAEII RNLDFGAATD RELVQQKITG
LMPTRKDLRD AAPAIGRGTI LGSLLGILPG GGAVIASFAA YTLEKKIARD PKRFGRGAIE
GVAAPESANN AAAQTSFIPL LTLGIPPNAV MALMVGAMTI HNIVPGPQVM KNQPELVWGM
IASMWIGNLM LLVINLPLVG IWVRLLRVPY RLMFPSIVVF CCIGIYSVNN APVDVVLAGA
FGLIGYWLVK HDFEPAPLLL GMVLGPLMED NLRRALLISR GDASVFITRP LSASLLVIAA
GLLILSVLPM LRRKRDEVFV ESEG