Gene RPC_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2039 
Symbol 
ID3973958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2221235 
End bp2222446 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID637925148 
Producthypothetical protein 
Protein accessionYP_531913 
Protein GI90423543 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.417724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC GATCGCTGAG CGCCAACCGG CTGCGCTCGG CGCTGACGAT GGTCGGCATC 
GTCGTCGGCG TCGCGGCGGT GATCGCGGTA ATGGCGGTGG GCGAGGGCGC ACGGGCGGCG
GTCGCGCAAC AAATTCGCGC GCTCGGTGCT AGCCTGATCA TCGTGACCTC CGGCGCCGGC
TTCCAGAGCG GCCTGCGCCT CGGTGCGGGC ACCACGTCTA ACCTGTCGGA AGCCGATGCC
GATGCCATTC AGAACGAGAT TCCTGAGGCC GTCACCGCCT CGCCGTTCCT GCGCACTCAG
GCGCAAGTCC TCGGCAATGG AACCAACACC GCGACCTCGG TGTTCGGCGC GGACAACCGC
TTCCTGACCG CTAGAGAATG GGATGTGGAA ATCGGCCGGC GGTTCGATGC CGAGGAAAGC
CGCAGCGGGG AAACCGTCGC GCTGATCGGG CGGACCGTGG CCGGCTTGCT GTTCCCCGAG
CAGAATCCGA TCGACGAACA GATCATCATC CGCGGCGTTC CGCTGCGGAT CATCGGCGTG
CTCGCGGTCA AAGGCCAGTC GATGGTGGCG CAGGATCAGG ACGACCTGGT GATCGTGCCG
ATCGACGTCG TGCGGCGGCG CATCATTGGC GGCAACCCGA CTGGCGACGG CAGCGTCGGA
GCGATCCTGG TCAAGGCCGA AGACGGCGCG GTGCTGTCCG AAACCAGCCA ATCGGTCCGC
GCCTTGCTGC GGCAACGCCA CCGGCTGGTC TCCGATCAGG AGGACGATTT CCAGCTTCGA
AATCTCACCG AAATCATGAA TGCGGTGGCC TCCAGCGCCA ACGCGGTGTC GTTGCTGCTG
GCCGCCGTTG CGGCGATTTC GCTGTTCGTC GGCGGGGTTG GAATCATGAA CATGATGCTG
GTCGCGGTGA CCGAACGGAT CCCGGAAATA GGACTGCGAC TGGCGATCGG GGCGACACGA
GCCAACATCC TGGCGCAGTT TCTCGCCGAG GCCGGCCTGC TGGCTGCGAC CGGCGGCGCG
GTCGGCGTGG CCATCGGCTG GGGATTGGCG GCGGCGATCG CGGCGATCGC CGCGTGGCCG
ACACTGATCG CTGCGCATCA TGTGCTGGGC GCGCTGTTGT TCTCCGCCCT GGTTGGCCTG
GTGTTCGGGT TCGTGCCGGC GCTGCGGGCC TCCCGGCTCG ATCCGATCGT CGCGCTGAGA
AGCCTGTCAT GA
 
Protein sequence
MALRSLSANR LRSALTMVGI VVGVAAVIAV MAVGEGARAA VAQQIRALGA SLIIVTSGAG 
FQSGLRLGAG TTSNLSEADA DAIQNEIPEA VTASPFLRTQ AQVLGNGTNT ATSVFGADNR
FLTAREWDVE IGRRFDAEES RSGETVALIG RTVAGLLFPE QNPIDEQIII RGVPLRIIGV
LAVKGQSMVA QDQDDLVIVP IDVVRRRIIG GNPTGDGSVG AILVKAEDGA VLSETSQSVR
ALLRQRHRLV SDQEDDFQLR NLTEIMNAVA SSANAVSLLL AAVAAISLFV GGVGIMNMML
VAVTERIPEI GLRLAIGATR ANILAQFLAE AGLLAATGGA VGVAIGWGLA AAIAAIAAWP
TLIAAHHVLG ALLFSALVGL VFGFVPALRA SRLDPIVALR SLS