Gene RPD_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4300 
Symbol 
ID4024824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4763521 
End bp4766484 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content68% 
IMG OID637964509 
ProductOuter membrane autotransporter barrel 
Protein accessionYP_571418 
Protein GI91978759 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.359518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCA GAGCGGATGA TCGGACGATC AGGCGGTTGG CGGCCCCTTG CGGATCGGAT 
GCCGACGTTC CTTCCGGCAA GCGCATCCGC CGCGTCGTGC TCGGGACAAC AGCGCTTGTC
GGCTTTTCGA TCTCCGTCGC GGCGGGGCTT CCCGGAATCA TTCTCCTGAG TTCACTCGCC
GCGACCTCGG CGCTTGCCGA TGGTGGGGCT GGCAGCGCTG GCTATGCATA CGGCGGTGTT
GGGGGGGCCG ATAGCGCCCC GGGCATTGGC GGTCTTGGTG GCAATGCGCC AGTCGCCCCC
GGAGGCGGCA GCGGTGGCGG TGGTGGCGCG GGCGCTACGG GTGGAGCAGG CGGTAACGGC
GACGCCTGGA CAGGCGTCGC GGGTCTTGGA GGCGCGGGAG GTGTCGCGGC CGGCTCCGCT
GGGCAGGTCG GAGGCAATGG CACCCCATCC GGTGGTCAGG CGGGCGGAGG CGGAGGCGGA
GGCGGCGCTC ATGGGTACGT GGGCGCGACG TTACCGGTAG TCCTCTCAAT CGGCGGAAAT
GGCGGCAACG GGGGAGTTGG GACCAACGTT TCTTCCTACG GCGATGGCGG CGGCGGTGGC
GCCGGCGGGT ATGGCGCCGT TGTCACCGGC GGCGGCGGCG GCACGCTGAC TTCTACGGTG
ACCGGCGGAA ACGGCGGAAA CGGCGGACTG GGCTTTCTCT GGGGAGACGG CGGAACGGGC
GGCATCGGCC TGCACTACTC CGGTGGCGCC TCATTCACCG CAAACGCGGT CGTAAGTGGT
GGCCAAGGCG GCACCGGTGC GAACGGCGGC GCTGGCGGCG CTGGTATCTA TATCGCGGCC
GGAAGTACGG GCATATTCAC TGCAAACGCG GTCATCACTG GCGGCAACGG TGGCAATGGC
GGCGTCACCT CCGGTGTTGG AGGTGTTGGC GTCTATCTTG CTGCCGGAAG TACAATTATC
AATGCCGGCA CAATCAGCGG GGGCCTCTCC GGCGATGGCG TAACTCGCGC CAACGCGATC
ACCTTTGCCG GCGGGATCAA TATCCTCGAG CTGCAGGCCG GCTCGCACAT CATCGGGAAC
GTCGTGGCCT TCGGCGGCGC CGACACGCTG CGGCTCGGCG GCGCCGGCAA TGCCAGCTTC
GACGTCGCGC AGATCGGCGC CGCCGCTCAG TATCAGAATT TCGGCGTGTT CGAGAAAACC
GGCGTGAGCG TCTGGTCCCT GACCGGCAAC ACCAGCGAGG CGATGCCCTG GGCGGTGAAT
GCCGGCACTC TCGCGGTGAA CGCCGCGATG GCGAATGCGA CCATGACCGT GAACAATGGC
GGCACCCTTG CCGGCGTGGG GACCGTGGGC GCGGTGACCG TGGCCAATGG CGGCACGCTC
GCCCCCGGCA ATTCGGTCGG GACCATCACG GTCTCGGGCA ATCTGACCTT CGGCGCCGGC
GGCTTCTATG CCGTCGACGT CTCGCCGTCG GCCGCCGATC GCACCAATGT CACCGGCACC
GCGACGCTCG GGGGCGCCAC GGTGAACGCG ATCTTCACGG GCGGAGGCTA TGTCGAGCGG
CAGTACACCA TCGTCAATGC CGCGGGCGGC ATCGTCGGCA GTTTCAGCAC GCTTGCGATG
AGCAATCTGC CGTCGGGCTT CAAGTCCAGC CTCGGCTATG ACCTTCACAA CGCCTATCTC
AATTTGGTGC TCGATCTGAC ACCGACACCT ACGCCCACAC CCACGCCTTC ACCTACACCT
GGCCCTGCAC CCACCCCCGC ACCCGCCCCG GCGCCGCTGC CGATCAACAG CGGCCTCAAC
GTCAACCAGA CCCGGACCGC CAATGCGCTG AGCGATTACT TCGCCCGGGT CGGACGGATT
CCGATCGTGT TCGGCGCGTT GACGCCGACC GGGCTCGGCA TGGTGGCGGG CGAGGTTGCC
ACGGCGACGC AGCAAACCAC CTTCGACGCC ATGGACCTGT TCTTGGGATT GCTGACCGAT
CCGTTCTCGG CGGGACGTAT GCCCGACATG CCGGGCGGCG CGTCGCACTT CGCCGACGCC
TCGGCCGGAC GAGGCCCCGC GCGCGATGCC TATGCGATGA TCACCAAGGC GAGTTGGCGC
GCGCCGCTGG AAGCCCGCTG GAATGTCTGG GCCGCCGGCT TTGGCGGCTC GCAGACCACC
GGCGGCAACG CCACGCTCGG GAGCAATACG GCGACCAGCC GCATCTATGG AACGGCCGTT
GGGGCTGACT ACTGGCTGTC GCCCTACACC GTGGCCGGCT TCGCGCTGGC GGGCGGCGGC
ACCAGTTTCG GTCTCGCCAA TGGGCTGGGC GCGGGCAACT CGGATTTGTT TCAGGCCGGC
GGGTTTGTTC GTCACACCGT GGGCGCGACC TATCTGACGG CGGCCGCCGC CTATGGCTGG
CAGGACATCA CCACCGATCG CATGGTCGGC ATCGCCGGAT TGAATCAGTT CCGCGCGCAT
TTCAACGCCA ACGCGTTTTC CGGACGGCTC GAGGGCGGCC ATCGCATCGT CGCGCCGCTG
TTCGGCGGCG TCGGGTTGAC GCCCTACGCC GCCATCCAGG TCACGGCGTT CGAGCTGCCG
GCCTATGCGG AGCAGGTCCT GGCCGGCGTC GACACCTTCG CGTTGAGTTA CGGCGCCAAG
GTCGTCACCG CTACGCGCAG CGAGCTTGGG CTGCGCTCCG ACATATCCTT CGCGCTGACC
GATGCGACCC TGACGCTGCG CGGTCGCGCC GCCTGGGCGC ACGATTTCAA CACCGAACGG
TCGGCGCTGG CGACCTTCCA GGTGCTGCCG GGCGCATCCT TCCTGGTCAA CGGCGCCGCG
CAGGCCCGCG ACGCCGCACG CGTCACCGCA TCCGCGGAAA CCAAATGGCT GAACGGCTGG
TCCGTCGCCG GCGCCTTCGA AGGCGAGTTC TCCAACGTCA CGCAGAGCTA CGCCGGCAAG
GGCGTGGTGC GATACGCGTG GTGA
 
Protein sequence
MTFRADDRTI RRLAAPCGSD ADVPSGKRIR RVVLGTTALV GFSISVAAGL PGIILLSSLA 
ATSALADGGA GSAGYAYGGV GGADSAPGIG GLGGNAPVAP GGGSGGGGGA GATGGAGGNG
DAWTGVAGLG GAGGVAAGSA GQVGGNGTPS GGQAGGGGGG GGAHGYVGAT LPVVLSIGGN
GGNGGVGTNV SSYGDGGGGG AGGYGAVVTG GGGGTLTSTV TGGNGGNGGL GFLWGDGGTG
GIGLHYSGGA SFTANAVVSG GQGGTGANGG AGGAGIYIAA GSTGIFTANA VITGGNGGNG
GVTSGVGGVG VYLAAGSTII NAGTISGGLS GDGVTRANAI TFAGGINILE LQAGSHIIGN
VVAFGGADTL RLGGAGNASF DVAQIGAAAQ YQNFGVFEKT GVSVWSLTGN TSEAMPWAVN
AGTLAVNAAM ANATMTVNNG GTLAGVGTVG AVTVANGGTL APGNSVGTIT VSGNLTFGAG
GFYAVDVSPS AADRTNVTGT ATLGGATVNA IFTGGGYVER QYTIVNAAGG IVGSFSTLAM
SNLPSGFKSS LGYDLHNAYL NLVLDLTPTP TPTPTPSPTP GPAPTPAPAP APLPINSGLN
VNQTRTANAL SDYFARVGRI PIVFGALTPT GLGMVAGEVA TATQQTTFDA MDLFLGLLTD
PFSAGRMPDM PGGASHFADA SAGRGPARDA YAMITKASWR APLEARWNVW AAGFGGSQTT
GGNATLGSNT ATSRIYGTAV GADYWLSPYT VAGFALAGGG TSFGLANGLG AGNSDLFQAG
GFVRHTVGAT YLTAAAAYGW QDITTDRMVG IAGLNQFRAH FNANAFSGRL EGGHRIVAPL
FGGVGLTPYA AIQVTAFELP AYAEQVLAGV DTFALSYGAK VVTATRSELG LRSDISFALT
DATLTLRGRA AWAHDFNTER SALATFQVLP GASFLVNGAA QARDAARVTA SAETKWLNGW
SVAGAFEGEF SNVTQSYAGK GVVRYAW