Gene Rpal_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2203 
Symbol 
ID6409863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2388328 
End bp2389377 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content62% 
IMG OID642712087 
Producthypothetical protein 
Protein accessionYP_001991199 
Protein GI192290594 
COG category[C] Energy production and conversion 
COG ID[COG4313] Protein involved in meta-pathway of phenol degradation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAG TGAATTCGGT GGCGCTGCCT GCGCTGTTGC TCGGGGCGGT CGTCTGTCTC 
GGCGGGACGG CCGCTCGTTC GGACGAAGCG GGCGTGAGCA TGTGGCTGCC CGGCACATTC
GGATCGATGG CTGCGGTGCC GACCGCTCCG GGATGGTCCG CGGCCGGCGT CTACTACCAT
ACGTCGCTCA GCGCGGGTCG CGAGGTCGCG ACCGCACGGG AAGTCCAGAT CGGGCGGTTC
ACCCCGAACT TGAATGTCGG GCTTTCGGCC GATCTCAATG CGACCGGCGA TCTCTTCCTG
CTGGCCCCGA CCTACACCTT TGCCACGCCG GTGCTGGGTG GGCAGGCTTC GGTGGGATTC
ACGAGCGTGA TGGGGCGCGC GAGTGCCGGA CTGAATGCGA CGCTGTCGGC AACGATCCCG
CCGTTCAACG TGATCCGTTC GGATTCGATC AACGATTCAG TGTCGGGCGT CGGTGACCTG
TATCCGGTCG CCAAGCTGAA GTGGAATCAA GGCGTCAATA ATTGGATGAT CTATGCGACC
GGCGATATTC CCGTCGGTGC GTATAACCGC AGCCGCATCA TCAATCTCGG CATCGGCCAC
GGCGCGATCG ATCTCGGCGG CGGCTATACG TACTTCAATC CGAAGGCGGG GTCGGAGTTC
TCCGTGGTGA TGGGCTTCAC CGAGAATTTC AAGAATACGT CGACGGACTA CAAGAACGGT
CTCGATTTCC ACCTCGATTG GGCCGTGTCG CAATTCGTAT CGAAGCAGTT TTTCGTCGGC
GCGGTCGGCT ATGCGTACAA TCAGCTCACC GGCGACAGCG GAACGGGTGC GAAACTCGGC
CCGTTCAAAT CGCGTGTCGA AGCGGTCGGT CCCCAGATTG GCTACCTGTT CCCGGTCGGC
GACATGCAGG GCGTCCTGAA TTTGAAGGGC TATTGGGAGT TCGACGCGCA GAACCGGGCG
AAGGGTTGGA ACACCTGGTT GACATTCGCC GTCTCCGCCC CGCCGCCGCC GCCGCCGCCA
CCGGCCGGTG CGAGCCTTCC AACCAAATAG
 
Protein sequence
MNKVNSVALP ALLLGAVVCL GGTAARSDEA GVSMWLPGTF GSMAAVPTAP GWSAAGVYYH 
TSLSAGREVA TAREVQIGRF TPNLNVGLSA DLNATGDLFL LAPTYTFATP VLGGQASVGF
TSVMGRASAG LNATLSATIP PFNVIRSDSI NDSVSGVGDL YPVAKLKWNQ GVNNWMIYAT
GDIPVGAYNR SRIINLGIGH GAIDLGGGYT YFNPKAGSEF SVVMGFTENF KNTSTDYKNG
LDFHLDWAVS QFVSKQFFVG AVGYAYNQLT GDSGTGAKLG PFKSRVEAVG PQIGYLFPVG
DMQGVLNLKG YWEFDAQNRA KGWNTWLTFA VSAPPPPPPP PAGASLPTK