Gene RPC_3629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3629 
Symbol 
ID3970644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4035973 
End bp4037307 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content65% 
IMG OID637926737 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_533483 
Protein GI90425113 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3659] Carbohydrate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.419401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00176786 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCTACA CGCGACATAG AAATTTCACC ACGCCGCGCG CTTGTGCCAC GGCATTGTTG 
GCGACCGGTT TGCTGGCCGG CGGCCTGGTC ACCGCCAGCG CGCAAGAGAA GAGCCTGGAA
GAGCGCGACA AGCTCACCGG CACCTGGGGC GGCGCCCGCA CCGCGCTGGA AGACAAGGGC
ATCGAGATCG GCGTTGTCTA TATCGGCGAA GTGCTCGGCA TCTCGGGCGG CGCCAAGCCC
GCCGGCGGCA CCCATGCCAC CTATGAGGGC CGTCTCGACG TCACCATCAA CACCGACCTG
GAGAAGCTGG TCGGCTGGGC CGGCGCCAAG ACCCATGTCC GCGCCTTCCA GATCCACAGC
GCGCAGGGCC AGAACGCCGC CAACTATGTC GGCTCGATCG CCGATCCCAG CAACATCGAT
GCCTACGGCA CCACCCGGCT GTTCACCGCC TGGTTCCAGC AGGAGTTCGG TACTTGGGGC
TCGATCCGCC TCGGCCAACT CGCCGGCGAC GACGAATTTC TGGTCAGCAC CACCGCGGGC
GGCCTGATCA ACGGCACCTT CGGCTGGGCC GCGATCATGG CGGCGAACCT TCCGAGCGGT
GGCCCGGCGT ATCCGTTGGC CACGCCTGGC GTGCGGCTGC AGGTCAATCC GACCGAGAAC
ATCTCGCTGC TCGGCGCGGT GTTCGCCGGC GATCCGGCGG GCAAGAATTG CACCAGCGGC
AACCAGCAGC GCGATTGCAA CCGTTTCGGC ACCACTTTCA GTCTCGACGG CGGCGCGTTC
TGGCTCGGCG AGGCGCAGTA CAATTTCAAC CAGGACAAGG ATGCCACCGG GTTGGCCGGC
TCCTATAAAG TCGGTGCCTG GTATCACACC GGCGATCGCT TCCTTGATCA ATACTATCAG
AGCAATCGCA GCACCGACTG GGGCATGTAC GGCGTGGTCG ACCAGATGCT GTGGCGCGGC
AAGGACGCCA GCACCAGCAT CTTTGTCCGC GGCGGCTGGA CGCCGTCCGA TCGCAATGTG
GTTTCTTGGT ACATCGACGG CGGCGTCGGC TTCAAAGGCT TCGTCCCGGG GCGCGAGGCC
GACACTCTGA CCATCGGTGT GGCGCATTCC AAAATCAGCA GGGAGGCGGC TGCTTACAGC
TTCGACAACT CCGCTTTGCG GCGTACCGGC GAAACCGTGC TCGAGGTCAG CTACATCGCC
CAGGTCAATC CGTGGTGGAC CGTGCAGCCG GACTTCCAAT ACATCGCCAA GCCGGCGGGC
GGCGCACTCC GCGACGACGG CTCGGTGGTC GACGACGCCT ATGTGTTCGG CGTCCGGACC
ACGATCACGT TCTGA
 
Protein sequence
MSYTRHRNFT TPRACATALL ATGLLAGGLV TASAQEKSLE ERDKLTGTWG GARTALEDKG 
IEIGVVYIGE VLGISGGAKP AGGTHATYEG RLDVTINTDL EKLVGWAGAK THVRAFQIHS
AQGQNAANYV GSIADPSNID AYGTTRLFTA WFQQEFGTWG SIRLGQLAGD DEFLVSTTAG
GLINGTFGWA AIMAANLPSG GPAYPLATPG VRLQVNPTEN ISLLGAVFAG DPAGKNCTSG
NQQRDCNRFG TTFSLDGGAF WLGEAQYNFN QDKDATGLAG SYKVGAWYHT GDRFLDQYYQ
SNRSTDWGMY GVVDQMLWRG KDASTSIFVR GGWTPSDRNV VSWYIDGGVG FKGFVPGREA
DTLTIGVAHS KISREAAAYS FDNSALRRTG ETVLEVSYIA QVNPWWTVQP DFQYIAKPAG
GALRDDGSVV DDAYVFGVRT TITF