Gene Rpal_3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3857 
Symbol 
ID6411537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4143765 
End bp4146686 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content58% 
IMG OID642713739 
Producthypothetical protein 
Protein accessionYP_001992830 
Protein GI192292225 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.663936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG CCGAATTCAT CAAGAAATGG AAGCCTGTCG CGCTGACGGA GCGCGCGGCC 
GCACAGACTC ACTTCCTCGA TCTGTGCAAG CTGTTCGAGC ACGAAGATCC GGTGTCGGCC
GACCCGACGG GTGAATGGTT CACCTTCGAG AAAGGCGCAA CCAAAACCGG CGGCGGCGAC
GGCTTTGCGG ACGTCTGGAA GAAGAACTAC TTTGCCTGGG AATATAAGAA GAAGAAGCGC
GACCTCGGCG TCGCAATGAA CCAGCTCGTT CGCTACGCTG CAGCACTGGA AAATCCGCCG
CTGCAGGTCG TTTGCGACAC CGATCGCTTC GTCATCCGCA CCGCCTGGAC CAATACGGTT
CCGAAGGAAT ACGAGATCGA GCTTGACGAT CTTGCCGATC CGGAGAAGCG CAAGATTCTC
TGGGCGGTGT TTCACGACCC CGAGCAGTTG CGGCCGCAGC AGACCCGCAC CGCGATCACT
AAGGAAGCCG CCGACAAATT CTCAACCATC GCGCTACGCC TGCAGGGCCG CGGCACGCCG
GAAGAGATCG CGCACTTTGT CAATCAGTTG GTGTTCTGTT TCTTTGCGAG CAGTGTCAAG
CTGCTGCCGG AAGGCTTCTT TCCGAAGCTA CTGAAGCGCG CAGCGCAAAA GCCACAACAT
GCCATCGACT ACTTCAACAA GCTGTTTGAG GCGATGGAGA ACGGGGGTGA ATACGACCTG
ACTGACATCG CGCATTTCAA CGGCGGGCTA TTCGACGGGC GCCGCGCGCT CAAACTCGAC
GAGGGCGATA TCGGCCTCTT GATCGAGGCC GGCAGTCTCG ATTGGGGCCA GATCGATCCG
ACGATCTTCG GCACGCTGTT CGAGCGTTTC CTCGATCCCG ACAAGCGGGC TCAGATCGGC
GCGCACTACA CCGACCCCGA CAAGATCCTG ATGATCGTTG AGCCGGTGAT TCTGCGGCCG
CTTCGAGCGG AATGGGACGC CGCACGCGCG AAAATTGCTG AGATCGCTGG AGAAGCCAAC
GCGTTGCAAC AAACCGGCTT CAGCAAGCAA GGCGCCAAGA GCTTCGACAA GAAGATCACG
AATATTCGTG CGAAAGCCGA AGTCATCCGG GATCAATTCA TCGAGCGGCT GCGCGGCATC
ACCATCCTCG ATCCAGCTTG CGGCTCGGGC AACTTCCTGT ATCTCGCCTT GCAAGGGGTT
AAGGATATCG AGCTCCGCGC CAACCTCGAA TGCGAAGCGC TCGGACTGTC GCCGCGACTT
CCGGTAATTG GCCCGGAAAT CGTCCACGGC CTCGAGATCA ATGAACTCGC CGCGGAGCTG
GCGCGCACCA CGATCTGGAT CGGCGACATC CAATGGCGCA TCCGCAACGG CATCTACTCC
AACCCGCGTC CGATCCTGCG CAAGCTGGAT TCGATCGAAT GCCGTGACGC GCTGATTACT
AAGCTAACAG ATGGAACTTA CGCAGAGGCC GAATGGCCTA CGGCTGAATT TATCGTGGGC
AATCCGCCAT TCTTGGGCGA CAAATTTATG CTTGATCGCT TGGGAGTGAG ATACACCCAA
GCACTTCGCG AAGCTTTTCT CGGCAGAGTC CCGGGAGGCT CAGATCTTGT TTGTTACTGG
CTAGAGAAGG CTCGAGCACA GATACTTTCA AATGAGACGT TTGGCGCAGG ATTTGTCGCG
ACCAATTCAA TACGCGGCGG AGCAAATCGC ACCGTTGTCG ACAGAGTCAC GGCTGATCTA
GACATTTTCT GCGCTTGGGC CGACGAAGAC TGGACAATCG AAGGCGCCGA CGTCCGTGTC
TCACTCATTT GCTTTTCCTC GAAAGGCCGG GCCCAATTGC TTGTCGAACT CAACGGTCAG
AGCGTCGCCC GCATATTCTC GGACCTGACA AGCAGCGCAA CCGATTTTAC ACGCGCTCGT
AGCCTGAGAT CGTGCCGTGA GGTTGCGTTT ATCGGCAATC AAAAGGGCGG CGCATTCGAT
CTACCGGGAT CAATCGCCCG TTCATTCCTC ACTCTGCCTC AGAACCCTAA CGGAAACTCG
AATGCGGATG TCGTCAAGCC ATGGATCAAT GGACTCGACA TCGTTCGACG CCCCAGGGAC
TATTGGATCA TTGACTTCAC TGGCTTACAA GAATCCGAGG CTGCTCTTTA CGAGGGGCCA
TTTCAGTACA TTCTGGAGCA CGTTAAGGAG TATCGGAACG AAGAAGCTCA CGAATCGAGC
AAGATGAATT GGTGGATACA TCAGAGGCCA CGCCATGCGC TTCGATTAGC CATAGACGGA
CAGTCACGCT ACTTGGCCAC CGCACGTGTC GCCAAGCACC GGCTATTCAT TTGGGTTGAT
CATCAAGTCG TACCTGACAG TCAGGTTGTA GCTATTGCAC GAAGCGACGA TGCGACCTTT
GGCATTCTAC ACTCTAGCTT CCACGAGTCA TGGACGCTTC GCCTCTGCAC ATGGCTCGGC
GTTGGCAACG ACCCACGCTA TACACCGACC ACTACCTTCG AAACCTTCCC TTTCCCCGAA
GGCTTGACGC CGGACATCCC GGCGGGGGAC TACGCCGACG ACCCGCGCGC GCAGGCGATC
GCGAAGGCGG CGAAGCGGCT CGACGAACTG CGCAAAGCGT GGCTCAATCC GCCCGATCTG
GTCCGGATCG AGCCGGAGGT CGTGCCGGGT TATCCCGACC GCATCCTGCC GAAGGACACA
AAGGCGGCGT CCGAATTGAA GAAGCGGACG TTGACCAATC TTTACAACGC GCGTCCGCAA
TGGCTGGCCG ACGCGCATCG CGATCTCGAT GCCGCGGTCG CTGCGGCCTA TGGCTGGCCC
GCCGACATTA CGGAAGACGA CGCACTGGCG AAGCTGCTGG AGCTGAATCT GTCGCGCGCG
GGCGCGTCGA GCCCGCCTCC GGCCAACAAG GATGAAGGTT AG
 
Protein sequence
MTPAEFIKKW KPVALTERAA AQTHFLDLCK LFEHEDPVSA DPTGEWFTFE KGATKTGGGD 
GFADVWKKNY FAWEYKKKKR DLGVAMNQLV RYAAALENPP LQVVCDTDRF VIRTAWTNTV
PKEYEIELDD LADPEKRKIL WAVFHDPEQL RPQQTRTAIT KEAADKFSTI ALRLQGRGTP
EEIAHFVNQL VFCFFASSVK LLPEGFFPKL LKRAAQKPQH AIDYFNKLFE AMENGGEYDL
TDIAHFNGGL FDGRRALKLD EGDIGLLIEA GSLDWGQIDP TIFGTLFERF LDPDKRAQIG
AHYTDPDKIL MIVEPVILRP LRAEWDAARA KIAEIAGEAN ALQQTGFSKQ GAKSFDKKIT
NIRAKAEVIR DQFIERLRGI TILDPACGSG NFLYLALQGV KDIELRANLE CEALGLSPRL
PVIGPEIVHG LEINELAAEL ARTTIWIGDI QWRIRNGIYS NPRPILRKLD SIECRDALIT
KLTDGTYAEA EWPTAEFIVG NPPFLGDKFM LDRLGVRYTQ ALREAFLGRV PGGSDLVCYW
LEKARAQILS NETFGAGFVA TNSIRGGANR TVVDRVTADL DIFCAWADED WTIEGADVRV
SLICFSSKGR AQLLVELNGQ SVARIFSDLT SSATDFTRAR SLRSCREVAF IGNQKGGAFD
LPGSIARSFL TLPQNPNGNS NADVVKPWIN GLDIVRRPRD YWIIDFTGLQ ESEAALYEGP
FQYILEHVKE YRNEEAHESS KMNWWIHQRP RHALRLAIDG QSRYLATARV AKHRLFIWVD
HQVVPDSQVV AIARSDDATF GILHSSFHES WTLRLCTWLG VGNDPRYTPT TTFETFPFPE
GLTPDIPAGD YADDPRAQAI AKAAKRLDEL RKAWLNPPDL VRIEPEVVPG YPDRILPKDT
KAASELKKRT LTNLYNARPQ WLADAHRDLD AAVAAAYGWP ADITEDDALA KLLELNLSRA
GASSPPPANK DEG