Gene RPC_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4624 
Symbol 
ID3972134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5165247 
End bp5166617 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID637927735 
Productagarase 
Protein accessionYP_534465 
Protein GI90426095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGTG AACAAGGAGA TAGATGGGTG CGACGAACCA AATGGGGCGG CCTTGCCGAT 
GGCTGTTTCG AACCCAGCGG CTTCTTTCGG GTGGAGCAGG ACGATGGCGT ATTCTGGTTC
GTCGATCCCG ATGGCGGCCG GTTTCTGTCC AAGGGCGTCA ACAACGTCCA GTTCGCTCCG
GATCAGATTC GCGGCACCGA TCGGACGCCC TATGCCGAGG CCTGCCTGGC GAAATACGGC
AGCCGCAACG AATGGCGTCG CGCGGCGGCG GAGCGATTGA CCGGCTGGGA TTTCAACACG
CTGGGCTGCT GGTCGGATGA GATCGTCGCC GGCGCAGGCG CCTTGCCGCT GGCCACCGCG
CCGATCGTCG ACCTCGGCGC TTCGTTCTGG CTGCATCGTC ACGGCCAGCG CTTTCCCGAC
GTGTTCGACG CCGAATTCGA CAATCACATC CGGCAACGCG CCAAGGACCT GTGCACGCCG
CGCCGCAATG CTCCGGAATT GCTCGGCACC TTCATCGACA ACGAGCTGTA CTGGTCGCCT
GACTGGCGCG GCCACGACGA GTTGCTGACC ACGTTCTTGA ATTTTCCGCC CGGACGCGCC
GGCCGCGTCA CCGCGATCAC CGCGCTGCAG CAGCACTATG GCGAGTTCGC CCACTTCAAC
GTGATCTGGC ACACGCCGGT GCGGTCCTGG GAGGCACTGC ATGCGCTCGA GACCATCGCG
GCGCCGTTTG TGCGCGCGGC GCCGGGCGGC GATTATGCCG TGCTCGAAGC GGAGGCCAAC
CGCAACCCGC GGCGCGCGGC GTTCGCCGCC GATTGCGACG CCTTTGCCGC GGTGGTGGCC
GACCGCTATT TCGAACTCTG CACGGCGGCG ATCAAGGCCG CCGATCCCAA CCATCTGGTG
CTCGGCGCAA GGCTCGGCGC GCTGCCGCAC GACGGCGTGG TCGCCGCCGC CGGCCGCCAT
CTCGACGTGA TTTCGTTCAA TTGCTACGGC TTCGACCCAT CGGCCTTGCT CGACGCCTAT
GCGGTGACCG GCAAGCCCTG CCTGATCACG GAGTTTTCGT TTCGCGGCGA CGATGCAGGC
CTGCCGAACA GTTGCGGCGG CGGTCCGCGG GTCGCCACCC AGGCCGACCG CGCCCACGCC
TTCGAGCGCT ATGTCGCCGC GGCGCTGATC AAGCCGAATC TGGTCGGCTA CCACTGGTTC
GAGCATGCCG ATCAGCCGGC CGAAGGCCGC TTCGACGGCG AGGACTGCAA TTACGGCACG
GTGACGATCA AAGACGAGGT CTATCCGGAA CTCACGGCAT CGATGAGCCG GTTGAATGCG
GCGGCGGAGA GCATCCATCG CAAGGCCGTG GCGGCGCGAC CGGCGGCTTG A
 
Protein sequence
MRCEQGDRWV RRTKWGGLAD GCFEPSGFFR VEQDDGVFWF VDPDGGRFLS KGVNNVQFAP 
DQIRGTDRTP YAEACLAKYG SRNEWRRAAA ERLTGWDFNT LGCWSDEIVA GAGALPLATA
PIVDLGASFW LHRHGQRFPD VFDAEFDNHI RQRAKDLCTP RRNAPELLGT FIDNELYWSP
DWRGHDELLT TFLNFPPGRA GRVTAITALQ QHYGEFAHFN VIWHTPVRSW EALHALETIA
APFVRAAPGG DYAVLEAEAN RNPRRAAFAA DCDAFAAVVA DRYFELCTAA IKAADPNHLV
LGARLGALPH DGVVAAAGRH LDVISFNCYG FDPSALLDAY AVTGKPCLIT EFSFRGDDAG
LPNSCGGGPR VATQADRAHA FERYVAAALI KPNLVGYHWF EHADQPAEGR FDGEDCNYGT
VTIKDEVYPE LTASMSRLNA AAESIHRKAV AARPAA