Gene Rpal_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4066 
Symbol 
ID6411750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4365448 
End bp4368693 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content66% 
IMG OID642713948 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_001993037 
Protein GI192292432 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.392825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGTTA GGGGCATTGG TGAGCTGAGG TCGAGGGCAA CGGATCGGGC GGGGAGAGCG 
CGAAGACGGA TCTTGCTTTC ATCGCTGTTG GCTTCGACGG CGCTGGTCGC AATCACGATG
CCCGCTTCTG CGCAGCAGGT CTGGGTCGGC ACCGGGCAGG ATTACAATAC GGCATCCAAT
TGGAGCGGTC CCGCCGCGGT GCCCGACACC GGTTCGACTG CAGTGTTCAC CAACAACGGT
GCGTCGACAT CGGTGGTGCT CTCGGTCACG CGCAGCCCGG ATGGCTTCAC GTTTGATACA
GGCGCGCCGA GCTACACGAT CGGAGTCGCA TCCGGCGGGC AGCTCAACAT GAGCGGCGCC
GGCATCGTCA ACAACTCCAG CAATGCCCAG AACTTCCTCA TCGGTCCCGG CAGCCAGATT
GATTTTCTGG GCTCCAGCAC CGCCGGCAAT GCGACCATCA CAACTCTGAG CGGGGCCACG
TTGCTGTTCA GCCGATCCTC GTCGGGAGGA ACGGCTTCAA TTGCCAACGA TGGGACGATG
GTCCTGCGGA CAGATTCGGG CGCGATCTCG ATCGGGTCGC TGTCGGGCAC CGGGACCGTC
GCGGCGACGA CGGTTAGCGG GGTTCCGGTC CAGACTCTGA CCGTCGGCAG CCTGAACACC
TCCACGGAAT TCTCAGGCAC GTTCGTCGAC AACGGAGCAC AATTCGCGCT CGGCAAGACC
GGCACCGGCA CGCTGACGCT CACCGGCGAC AATTTCTACA CCGGCGGCAC CACGATCTCG
AGCGGAACGC TGCAGCTCGG CAATGGCGGG ATCAGCGGCT CGATCACCGG CGACATCACC
AACAATGCGA CGCTGACGGT CAACCGCAGC AACGGCACCA GCCTCGGCGG CGTGATCTCC
GGCAGCGGCC AATTAGTCAA GCTTGGCGGC GGCATTCTAG CGCTGCTCGG CAACAACACT
TACACCGGCG GCACGACGAT CTCGGCGGGC ACGCTGCGGG TCGGCAACGG CGCCACCAGC
GGTTCCATCG TGGGCGACGT CGTCAACAAC GGCGTACTGC AGTTCAATCG GTTCGACTCG
ATCGGCTTCA ATGGCGTGAT CACCGGCACC GGCAGCGTCA CCAAGCTCGG CAATAACGCG
ATGATCCTGG GCGGGGACAA CACCTATACG GGCGGCACGA CAATCAGCGG CGGTTACTTG
CAGGTCGGCA ATGGGGGCAC CGGCGGCTCG ATCGTCGGCG ACGTCCTCAA CAACGGGACG
CTGGAATTCG CGCGTTCCGA CGCCCATACG TTCAGTGGCG CTATTTCCGG CACCGGCAAT
CTGATCAGCT TCGGCGGCAG CGCCGGCAGT GGCGTTTTCA CGATGACCGG AACCAATACT
TACACCGGGG GCACCACCGT CTCCAGAGGC ACATTGCAGA TCGGCGACGG CGGCACCTCG
GGCTCGATCG TCGGCGACGT CACCAACAAC GCCACGCTCG CCTTCAATCG CTCCGACGCG
ACCAGCTTCG GCGGCGCGAT CTCAGGCGGC GGCAATCTGA TCAAGCGCGG CGCCGGCAAC
CTGTCGCTGA CCGGCGTCAG CAGCTACACC GGCGCCACCA CGGTTGAAGC CGGCACGCTC
AGCGTCAACG GCTCGATCGC GTCCTCGTCG CTGACGACGG TGAACGCCGG CGCAGCGCTC
GGCGGCAACG GCACGGTTGG CACCACGCTG ATCAACGGCG GCGCGCTGGC ACCCGGCAAT
TCGATCGGCA CGCTGAATGT GAGCGGCAAC CTGACCCTCA CGGCTGCGTC GAGCTACATG
CTCGAGCTGT CGCCGAGCAG CGCCGACCGC GTCAACGTCA GCGGCACCGC CACGCTCGGC
GGCGCCACGG TGAAAGCGTC GTTCGCCAGC GGCGGCTATG TCGAGCGGCA ATACACTCTC
GTCAATGCGA CCGGCGGCGT GGTCGGCACC TTCGGCACGC TGGTGAATAC CAATCTGCCG
TCCGGCTTCA GATCGAACCT CGGCTACGAT TCCAACAATG CCTATCTCAA TCTGGTGCTC
GACTACACGC CCGGTCCGTC GCCGGACATC AACAGCGGCC TGAACCGCAA CCAGACGGAG
GTCGCCAATG CGCTGAGCGG TTACTTCGCG CGCACCGGCA GCATTCCGAT CGTGTTCGGC
GCGCTGAACC CGAGTGGGCT CAGCGCCGTG TCGGGCGAGA CCGCGACCGG CGCGCAGCAG
TCGACGTTCA GTGCCATGAC CCAGTTCCTG GGCGTGCTGA CCGATCCGTC GAGCAACGGC
CGCGGTGCGC GGGATGCTGC GCCGGGGCCG TTGGGGTTCG CGGATCGCAC GCCCCGCGGC
TCGGCGTCCG ACGCCTATGC GATGATCACC AAGAGCGCTG CCGAGCGGTT CGTTCCGCAT
TGGAATGTGT GGGGCGCGGG CTTCGGCGGC TCACAGACCA CCGATGGCAA TGCTTCGCTC
GGCTCCGCCA CCGCAACCAG CCGGCTCGCC GGCATTGCTG CAGGTGCCGA CTACTGGCTG
TCGCCGCAGA CTGTCGCGGG TTTCGCGATG GCCGGCGGCG CCACGCAATT CGGACTAGCG
GGCGGCCTCG GCTCGGGCAC GTCGGATCTG CTCCAGGTTG GCGGCTTCAT CCGCCACAGT
TTTGGTGCGA GCTACCTGAC CGCAGCGGCG GCCTATGGCT GGCAGGACAT CACCACCGAA
CGCACCGTCG CGATCGGCGG CCTCAATCAG CTCCGCGCCA ACTTCAACGC CAACGCTTAC
TCCGCGCGGG TCGAGGCCGG GCATCGCTGG ATCGCCCCGG CGATCGGCGG TGTTGGTCTG
TCACCGTACG CTGCCGCGCA AGTGACGGCC TTTGATCTGC CGGCCTATGC CGAGCAGGCT
GTGGGCGGAA CCGGCGTGTT CGCGCTCGGC TATGCGGCCA AGACCGTGAC CGCGACGCGC
AGCGAGCTCG GCGTGCGGAC CGACAAGTCG TTCGCGCTGG ATGGCGCGCT GCTGACGCTG
CGCGGCCGCG CCGCCTGGGC GCACGACTTC GATGTCGACC GGTCGGTGGC GGCGACCTTC
CAGGCGCTGC CCGGCGCCAG CTTCGTTGTG AACGGCGCGC GACCGGCGCG CGATGCGGCG
CTGACCACGG TGTCGGCGGA AGTGAGCTGG CTGAACGGCT TCTCGGTCGC CGCCAGCTTC
GAAGGCGAGT TCTCCGACGT GACCCGCAGC TATGCCGGCA AGGGACTGCT GCGCTACGCG
TGGTGA
 
Protein sequence
MQVRGIGELR SRATDRAGRA RRRILLSSLL ASTALVAITM PASAQQVWVG TGQDYNTASN 
WSGPAAVPDT GSTAVFTNNG ASTSVVLSVT RSPDGFTFDT GAPSYTIGVA SGGQLNMSGA
GIVNNSSNAQ NFLIGPGSQI DFLGSSTAGN ATITTLSGAT LLFSRSSSGG TASIANDGTM
VLRTDSGAIS IGSLSGTGTV AATTVSGVPV QTLTVGSLNT STEFSGTFVD NGAQFALGKT
GTGTLTLTGD NFYTGGTTIS SGTLQLGNGG ISGSITGDIT NNATLTVNRS NGTSLGGVIS
GSGQLVKLGG GILALLGNNT YTGGTTISAG TLRVGNGATS GSIVGDVVNN GVLQFNRFDS
IGFNGVITGT GSVTKLGNNA MILGGDNTYT GGTTISGGYL QVGNGGTGGS IVGDVLNNGT
LEFARSDAHT FSGAISGTGN LISFGGSAGS GVFTMTGTNT YTGGTTVSRG TLQIGDGGTS
GSIVGDVTNN ATLAFNRSDA TSFGGAISGG GNLIKRGAGN LSLTGVSSYT GATTVEAGTL
SVNGSIASSS LTTVNAGAAL GGNGTVGTTL INGGALAPGN SIGTLNVSGN LTLTAASSYM
LELSPSSADR VNVSGTATLG GATVKASFAS GGYVERQYTL VNATGGVVGT FGTLVNTNLP
SGFRSNLGYD SNNAYLNLVL DYTPGPSPDI NSGLNRNQTE VANALSGYFA RTGSIPIVFG
ALNPSGLSAV SGETATGAQQ STFSAMTQFL GVLTDPSSNG RGARDAAPGP LGFADRTPRG
SASDAYAMIT KSAAERFVPH WNVWGAGFGG SQTTDGNASL GSATATSRLA GIAAGADYWL
SPQTVAGFAM AGGATQFGLA GGLGSGTSDL LQVGGFIRHS FGASYLTAAA AYGWQDITTE
RTVAIGGLNQ LRANFNANAY SARVEAGHRW IAPAIGGVGL SPYAAAQVTA FDLPAYAEQA
VGGTGVFALG YAAKTVTATR SELGVRTDKS FALDGALLTL RGRAAWAHDF DVDRSVAATF
QALPGASFVV NGARPARDAA LTTVSAEVSW LNGFSVAASF EGEFSDVTRS YAGKGLLRYA
W