Gene Rpal_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2052 
Symbol 
ID6409712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2223995 
End bp2225617 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content65% 
IMG OID642711938 
ProductPepSY-associated TM helix domain protein 
Protein accessionYP_001991050 
Protein GI192290445 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAAGC ACGATCGACA ATCCGCGAGC CGCAGCATCC GGCAGAGCAT GTCGGACCTT 
CACACCTGGA CCGGGCTGCT GCTCGGCTGG GTGCTGTATG CGATGTTCCT CACCGGCACG
GTGTCGTTCT TCAAGGAGGA GCTGTCGCAG TGGATGCGGC CGGAACTGCC GCGCGTGACG
CAGGCGCTGG ACCCGGCCGT GGTGGCGGAG CGCGTCGCCG ATGAGATCGG CCGGATCGCG
CCCAACGCGA CGCAGTGGAG CATCAAGCTC CCCGAGGGGC GCAGCAACAG CGTCTATGCG
TTCTGGCGGC TTCCGATCGC GCAGGATGCG CGCGGCTTCG GAGAAGGGCA CTTCGACGCG
GTGACTGGCC GTCAGGTCGA GGCTCGGGGA ACACTGGGCG GCGATTTCTT CTACCGCTTC
CATTTCCAGT TCTACTACAT GTCGCCGTTC TGGGGGCGGC TGCTTGCCGG TCTGGCGGCG
ATGTTCATGC TGATCGCCAT CGTCGCCGGC GTCATCACCC ACAAGAAGAT CTTCACCGAC
TTCTTCACCT TCCGCTGGGG CAAGGGGCAG CGCTCCTGGC TCGACGCCCA CAACGCGCTG
TCGGTGTTCG GCCTGCCATT TCATGTGATG ATCACCTACA CCGGGCTGGT GACGCTAATG
GCGCTGTACG TGCCATGGGG CGAGCGCGCC GCCATCAAGA CGCCCGCCGA GCGCCAGCAG
CTGATGGCGG AGCTCAGTGC TTTCATTCAG CCCGGCAAGC CCGCGGCCGA AGCGGCGCCG
CTCGGGTCGA TCGAAACCAT GGTGCGGCAG GCTCAGGTTC GATGGGGTAC GCCCGATGTC
GGGCGCGTCA ACGCGGCCAA TCCGGGCAAC GCGGCTGCCC GTATAGCGGT GACCCGTGGC
GATGCCGGGC GTGTATCAAT GAGTCCGGAT TACCTGGAGT TCGACGGCGT CACCGGAAAA
CTGCTCACCG TGCATGATCA TGTCGGTGCT GCGGCCGAAA CCCGCGGCGT GCTCTACGCG
CTGCACATCG GGCGGTTCAG CGACCTCGAA ACCCGGTGGC TTTACTTCAT CGTCAGCTTC
ATGGGCACCG CGATGGTCGG TACCGGTCTG GTGATGTGGA CAGTGAAGCG ACGGCAGAAG
CTGCCTGATC CAGAGCGGCC GTATTTCGGA TTCCGTCTGG TCGAGCGGCT CAACATTGCC
AGCATCGCCG GGCTGTCGAT CGCCATGACG GCGTTCCTGT GGGCCAACCG TCTGTTGCCG
ACCGCGATGG CGGAGCGGGC GTTCTGGGAA ATCCATGTGT TCTTCATCGT CTGGGGGCTG
ACCTTGCTCC ACGCACTGCT GCGGCCGGCG CGAGTGGCCT GGGTCGAGCA GCTATGGACG
GCCGCTGCGT TGTTAGCGTT GATCCCGGTG CTCAACGCGA TGACGACGCT GCGTCCGCTG
TGGCACAGCT TCGCTATCGG GGATTGGGTG TTCGTCGGCA CGGATCTGAT GTGCTGGACG
CTGGCGCTGC TGCATGCCGT GCTGGCGATC CGCACCGCGC GTCACGGCGC GCGGGTTCGC
CCGCCGCGCG GCTCGGCGAC ACGCCACGCG CTCCCAACGA TGTCGAGCGA GGCGGCAACA
TGA
 
Protein sequence
MTKHDRQSAS RSIRQSMSDL HTWTGLLLGW VLYAMFLTGT VSFFKEELSQ WMRPELPRVT 
QALDPAVVAE RVADEIGRIA PNATQWSIKL PEGRSNSVYA FWRLPIAQDA RGFGEGHFDA
VTGRQVEARG TLGGDFFYRF HFQFYYMSPF WGRLLAGLAA MFMLIAIVAG VITHKKIFTD
FFTFRWGKGQ RSWLDAHNAL SVFGLPFHVM ITYTGLVTLM ALYVPWGERA AIKTPAERQQ
LMAELSAFIQ PGKPAAEAAP LGSIETMVRQ AQVRWGTPDV GRVNAANPGN AAARIAVTRG
DAGRVSMSPD YLEFDGVTGK LLTVHDHVGA AAETRGVLYA LHIGRFSDLE TRWLYFIVSF
MGTAMVGTGL VMWTVKRRQK LPDPERPYFG FRLVERLNIA SIAGLSIAMT AFLWANRLLP
TAMAERAFWE IHVFFIVWGL TLLHALLRPA RVAWVEQLWT AAALLALIPV LNAMTTLRPL
WHSFAIGDWV FVGTDLMCWT LALLHAVLAI RTARHGARVR PPRGSATRHA LPTMSSEAAT