Gene RPB_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2662 
Symbol 
ID3910455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3045565 
End bp3046767 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content67% 
IMG OID637884562 
Producthypothetical protein 
Protein accessionYP_486275 
Protein GI86749779 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.752873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.243579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA TGGCCGTGCT CAGCGAAGGC CGTTCCGCGG AACACGCGTC GTCGTCGGTG 
CGTCCCGGTC GCATCGCACA TGTCGAGATT TTCCGCGACA TGGCCTCGAC GGAAGCGATC
TGGCGCGCGC TGGAACAGCC CGAACAATTC TCCACGCCGT ATCAGAGGTT CGACTTGCTC
GACGCGTGGC AGCGCCATGT CGGCCGCGCC GATCATGTCG AACCCTTCAT CGTGGTGGCC
AGCGACGCCG AGCAGCGGCC CTTGCTGCTG CTGCCGCTCG GCCTGGAGCG GCGCTTCGGC
GTTCGGATCG CGCGCTTCCT CGGCGGCAAG CACACGACGT TCAACATGCC GCTGTGGCGC
AGCGATGTCG CGCGGACCGC GGATGCGAAC GACCTCGCCG CCCTTGTCGC AGGCCTGCGG
GCGCGCCCGG ACGGCGCCGA CGTGCTGGCG CTGTCTCAGC AGCCGCTTCG CTGGCGCGAC
CTCGCCAACC CGATGGCGCA GCTGCCGCAT CAGCCCTCGA TCAACGATTG TCCGGTGCTG
CTGGTCGATC CTGCCGCGCC GCCGACCGAC CGGATCAGCA ACTCGTTCCG CCGCCGGCTC
AAGACCAAGG AGAAGAAGCT CCAGACATTG CCCGGCTATC GCTACGTCCA GGCCAGGAGC
GACGCCGACG TCGAACGCGT GCTCGATGCC TTCTTTCGGA TCAAGCCGAT CCGCATGGCG
GCGCAGAAGC TGCCGAACGT GTTCGCCGAC CCGGGCGTCG CGGATTTCAT CCGCCAGGCC
TGCATGACCG AGCTCCGGGG AGGCGGCCGG GCGATCGAGA TCCACGCGCT CGAATCCGAC
GACGAGACGA TCGCGATGTT CGCCGGCGTG GCCGACGGCC ATCGCTACTC GATGATGTTC
AACACCTATA CGCTGTCGGA GGCGTCGCGC TACAGTCCCG GCCTGATCCT GATGCGCTCG
ATCATCGATC ACTACGCCGC GCAGGGCTAT CGCCGGCTCG ATCTCGGCAT CGGCTCCGAC
GACTACAAGA AACTGTTCTG CAAGGACCTC GACCCGATCT TCGACAGTTT CATCGCGCTG
TCGCCGCGCG GCCGTCCGGC CGCCGCAGCG ATGGCATCGA TCGCTCGCGC CAAACGCGTC
GTCAAGCAGA CCCCTGCCCT GATGCAGATC GCGCAACGGC TGCGCAGCGC GCTGCATCGC
TGA
 
Protein sequence
MTMMAVLSEG RSAEHASSSV RPGRIAHVEI FRDMASTEAI WRALEQPEQF STPYQRFDLL 
DAWQRHVGRA DHVEPFIVVA SDAEQRPLLL LPLGLERRFG VRIARFLGGK HTTFNMPLWR
SDVARTADAN DLAALVAGLR ARPDGADVLA LSQQPLRWRD LANPMAQLPH QPSINDCPVL
LVDPAAPPTD RISNSFRRRL KTKEKKLQTL PGYRYVQARS DADVERVLDA FFRIKPIRMA
AQKLPNVFAD PGVADFIRQA CMTELRGGGR AIEIHALESD DETIAMFAGV ADGHRYSMMF
NTYTLSEASR YSPGLILMRS IIDHYAAQGY RRLDLGIGSD DYKKLFCKDL DPIFDSFIAL
SPRGRPAAAA MASIARAKRV VKQTPALMQI AQRLRSALHR