Gene Rpal_3346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3346 
Symbol 
ID6411019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3604532 
End bp3606580 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content64% 
IMG OID642713225 
Producthypothetical protein 
Protein accessionYP_001992323 
Protein GI192291718 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.677729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTCT CTCTGGTCAT CGACGGCGAC AGCAACGGCG CGGTGGAAGC CGCGACCGAT 
GCGTCGAAAG CCGTCACCGA CCTGGGAACG ACCGCGACCG ATACCGCCAA GGGCCTTGAA
GACGGCTTCA AGAAAGCCAG CGGCGCCGCC GGATCGATGA AGGGCAGCGC CGAGGCGGCG
GGCGCGGCGA ATGACAACCT GCTGGCATCG ACTTCCGCGC TGTTATCGAA GTTCGGAGAA
CTGGCTTCGA AGGCCAAGGG ATCGAACGAC GCGCTCGCGC TGAACGCCAG CACGGCGAGC
AGCCTGACGG CGAGCCTGGG CGCGCTGTCG CGTTCGGTCG GCGTCGTCGG ATTCCTGACC
GGCGCGATTG GCCTCGCGAC CGCAGCGCTC TCCGCCTACA ACGAGCTGTC GGCGCGCGCC
AGCCGCGAGA TCGACCGGCA GCTCACCGAG CAAGCCCGCC TGATAAACGT GGTGCGCGGC
GCTTACCAGA ATGCCAAGAC GGCGGCAGGC GATTTTTACG AGCAGAGCAA AGATGTGACG
CTGCTGCAGC TGCGGCAGAA TCAGGTCGAT CTGCAGAGGA CGCTACAAAC GCAGGTCGGT
TCCTTTATCG GAGGTGTTAC CAGTTTCGGC GATAGCCTTG GAAACTACTT CAATAACGTT
AAGACAATTA GGGGAGAACT CCTACCGTTT GAAGATGCCA TCTTTAGGCT CAATGATGGC
TTCAAACAGG GCAAGCCCGA CGTCGAAGGG TTTGTCAACG AAATTGCGCG GATCGCGTTA
CTAGATCCCT CTCTACAAAA AGCTGCGTCA GATCTAATTA CCAAGATCAA CGAAGCGTCC
AAGACCGCTC GGGCGCTGAA GGACCTAAGT TCTGGCGAGA AAGTCATAGA AGGAAAGGAC
AAGCCGGAGG ATCGCAAACG CGTCGGCATC ACCGACCAGG TTTCCGACAC CGCCGGTCAG
TTCGAGCGCC TTGCCAAGTC GATGGACCGG CAGGCCGCGA GTGCCGAGGC CGAAGCCTCC
GCGACTGGCA AGAGCGCCGG CGAGATCGCC AAGCTGCGCG TCGAGGCGGT GCTGACCGAA
GCCGCGCAGC AAGCCGGCGG CGGGACGGCC GAGAAGTACG CAGAGCGCAT CAAACAGATC
GGAGATCGCG CCGGCGAAGC TGCGCAGAAG CTGGCGCTGG CTCGTGTGCA ATCGGACGCG
GCGTTCCAGC GCCAGACGCT GGGGCTATCG GCCGACGACG CCCAGATCGC CGAGCAGCTG
CGCGGTGCCT ATGGGGACAA CGTTCAACGG GCGTTGAGCA GCGCGGACGC CGCGGCACTG
AAGTTCAACC AGGACATGCT GCAGCTGAAG AACACCACGC TCGACGTCAG CAAGGGGGTT
TTCACCGATT TCCGCACCTC GATCCAGCAA GGCGCGACCG CCATGGAGGC GCTGGGCAAT
GCTGGCGTCA ATGCGATCGG CAGGATCGCC GACAAGATCG CAAGCACTCA GCTCGACAAC
CTGGTCTCCG GACTGTTCGG CGCGTTCACG GGGGGCAGCG GCGGCGGCCT GCTGTCGTCG
CTATTCGGCG GTGGTAGCAA GGGCAGCTTC GCCACGGACG GCATTGGTGG TTTCGGTCCG
ACCTTCACGG CGGCCGACGG CGGCACCTTC GGCCCCGGCT GGGGCGTGGT CGGCGAGCGC
GGCGCCGAGA TCATCAAGGT GCACGCCGGC GGCGTCACGG TGTTTCCGCA TCACGTCAGC
AAGCCGTATC TGCCGGGCTT CGCCGATGGC GGCTCGCTCG ATCAGCTCGG CAACATCGCG
CGACTGCCGA GCTTTCAAAG TTCTTCGTCC GGCGGATCGG CCGGGAGCGC CGCACCGCAG
GAGTTCCGTC TCTCCGTCAG CGTCGACGAC GACGGCAAAC TGAAAACCGT GGTGACGGAC
GTCGCGAAGG GAATTGCTCA GGAGGTCTCT GCGGGCACCG TGAAGGACTT CGTGCGCAGC
CCGCAATTCA CCGATCACGT CGCACGTGCA TACGGCGACG CCAAGGCCGC GTGGAAGATC
AGATCATGA
 
Protein sequence
MRVSLVIDGD SNGAVEAATD ASKAVTDLGT TATDTAKGLE DGFKKASGAA GSMKGSAEAA 
GAANDNLLAS TSALLSKFGE LASKAKGSND ALALNASTAS SLTASLGALS RSVGVVGFLT
GAIGLATAAL SAYNELSARA SREIDRQLTE QARLINVVRG AYQNAKTAAG DFYEQSKDVT
LLQLRQNQVD LQRTLQTQVG SFIGGVTSFG DSLGNYFNNV KTIRGELLPF EDAIFRLNDG
FKQGKPDVEG FVNEIARIAL LDPSLQKAAS DLITKINEAS KTARALKDLS SGEKVIEGKD
KPEDRKRVGI TDQVSDTAGQ FERLAKSMDR QAASAEAEAS ATGKSAGEIA KLRVEAVLTE
AAQQAGGGTA EKYAERIKQI GDRAGEAAQK LALARVQSDA AFQRQTLGLS ADDAQIAEQL
RGAYGDNVQR ALSSADAAAL KFNQDMLQLK NTTLDVSKGV FTDFRTSIQQ GATAMEALGN
AGVNAIGRIA DKIASTQLDN LVSGLFGAFT GGSGGGLLSS LFGGGSKGSF ATDGIGGFGP
TFTAADGGTF GPGWGVVGER GAEIIKVHAG GVTVFPHHVS KPYLPGFADG GSLDQLGNIA
RLPSFQSSSS GGSAGSAAPQ EFRLSVSVDD DGKLKTVVTD VAKGIAQEVS AGTVKDFVRS
PQFTDHVARA YGDAKAAWKI RS