Gene Rpal_4370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4370 
Symbol 
ID6412054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4696164 
End bp4698122 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content66% 
IMG OID642714252 
Productprotein of unknown function UPF0118 
Protein accessionYP_001993341 
Protein GI192292736 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTGG GCAAAGCCCA ATCCCTGGAA GACGTCGCCG GTCTGGTCGG CGCCTGCGCG 
GTCACGATCC TGGCCGTCAT CATCATCTCG GCGCTGTATG TCGGCCGAGA GGTGTTCGTC
CCGGTCGCCC TGGCGATCCT GCTGAGCTTC GTGCTGGCTC GCCCGGTCAA CTTCCTGCAG
TCACTGCGGG TGCCGCGGGC AATCGCGGCG ATCACCACCG TCCTTTTCGC CTTCGCGGTG
ATCTTCGCGC TCGGCAGCCT GATCGCGACG CAGCTGTCGC GGCTGGCCGA CGACCTGCCG
CAGTACCAAT CGACGATCCA ATCCAAGATC ACCTCGCTGC GCGGCGTGAC CGGCGGCTCC
ACGACGCTCG AACGCGCCGA GGGGATGCTG CAGAATCTCA GCAAGGAACT GAACAAACCG
AAGAACGCGC CCGCGCCGTC GCTGAGCAAT CCGCCGACGA CGTCGTCGAG ACCGGTCACC
CCCGTTCCGG TCGAAGTCCT GCAGCCCGAC CCGGGGACCT TGGCGAACCT GCGATCTCTG
ATCGCGCCGC TGATCTCGCC GCTGGCGACG ACCGGCATCA TCGTGATCTT CGTCATCTTC
ATCCTGTTGC AGCGGGAGGA CCTGCGCAAT CGGCTGATCC GGCTCGCCGG CACCCGCGAT
CTGCAGCGCA CGACCGCGGC GCTGGATGAT GCCGCCAGCC GGCTGAGCCG CTTGTTCCTC
AATCAGCTGC TGATCAACTC CGGCTTCGGC GTGCTGATCG GCACCGGGTT GTGGATCATC
GGCGTGCCGA GCCCGGCGCT GTGGGGCATT CTCGCCGCGG TGCTGCGCTT CGTGCCGTAT
ATCGGATCGA TCATCTCAGC GGCCTTCCCA CTGACCCTGG CGGTCGCGGT CGATCCCGGC
TGGTCGATGC TGGTGTGGAC GGCGATCCTG TTCTTCGTGA TCGAACCGGC GATCGCCCAT
GTCGTCGAGC CGATGGTGTA CGGCCGTAGT ACCGGGCTGT CGCCGGTCGC CGTGGTGATC
TCGGCGACGT TCTGGACGGC GCTGTGGGGC CCGATCGGCC TCGTTCTCGC CACGCCGCTG
ACGGTGTGTC TCGTCGTGCT CGGGCGGCAC GTCGAGCGGT TGGCGTTTCT CGACGTAATG
TTCGGTGATC GGCCGGCGCT ATCGCCGCCG GAGATCTTCT ATCAGCGCAT GCTGGCCGGC
GACCCGGCCG AAGCCGCCGA GAAGGCCGAG CAATTTCTCA AAGAACGGTC GCTGTCGTCG
TATTACGACG ACGTCGCCCT GAAAGGCCTG CAACTAGCCC AGGCCGACCT CGATCGCGAC
GCACTCGACG CCGTGCGCCT GACGCGGATC AAGGAGACGG TGCAGGAGTT CACCGAGGAC
CTCACGGACG AAATCGATCA GGCGCCGGAC GGCGACGAAG CCACCACCGA CGCCGAGGCT
GCTGCCGCCG TCGAAGTGAC GCCGGTCGAT CACGCCGACG ACGACATCGC AGTGCTGAAG
CCCGCCGACC TCAAGCCTGG ATGGCAAGGC GCCGCACCGG TGATGTGCAT CGGCGGACGG
TCGCAATTGG ACGAAGCCGC GGCGCTGATG CTGGCGCATT TGTGCCGCGT GCACGGCATC
GGCGCCCGTG TCGAGCCATC GAGCGCGCTG TCCACCAAGA ACATCTTTGG CCTCGACGTC
TCGAACGTCG CGCTGATCTG CCTGTCGTAT CTCGAGGCGT CGAACACGAC CCATATCCGC
TACGCCGTCC GCCGCCTGCG TCGCAAGGCG CCGCACGCCA AGATCATCGT CGCTTTGTGG
TCGGCGGAGA CTCCGCAACT GGCCGATACC AACGAATCCG CGCAGGCCGA CGCGACGGTG
CTGACGCTGC GGGACGCCGT GAAATACTGC GTCGAAGAGG CGATCATTGA GCCGCCGCCA
CAGACGATCG AAATGCCGGT GATCAGCGAG GCTGTGTAG
 
Protein sequence
MKLGKAQSLE DVAGLVGACA VTILAVIIIS ALYVGREVFV PVALAILLSF VLARPVNFLQ 
SLRVPRAIAA ITTVLFAFAV IFALGSLIAT QLSRLADDLP QYQSTIQSKI TSLRGVTGGS
TTLERAEGML QNLSKELNKP KNAPAPSLSN PPTTSSRPVT PVPVEVLQPD PGTLANLRSL
IAPLISPLAT TGIIVIFVIF ILLQREDLRN RLIRLAGTRD LQRTTAALDD AASRLSRLFL
NQLLINSGFG VLIGTGLWII GVPSPALWGI LAAVLRFVPY IGSIISAAFP LTLAVAVDPG
WSMLVWTAIL FFVIEPAIAH VVEPMVYGRS TGLSPVAVVI SATFWTALWG PIGLVLATPL
TVCLVVLGRH VERLAFLDVM FGDRPALSPP EIFYQRMLAG DPAEAAEKAE QFLKERSLSS
YYDDVALKGL QLAQADLDRD ALDAVRLTRI KETVQEFTED LTDEIDQAPD GDEATTDAEA
AAAVEVTPVD HADDDIAVLK PADLKPGWQG AAPVMCIGGR SQLDEAAALM LAHLCRVHGI
GARVEPSSAL STKNIFGLDV SNVALICLSY LEASNTTHIR YAVRRLRRKA PHAKIIVALW
SAETPQLADT NESAQADATV LTLRDAVKYC VEEAIIEPPP QTIEMPVISE AV