Gene RPC_4850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4850 
Symbol 
ID3973593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5411215 
End bp5412243 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content66% 
IMG OID637927962 
ProductNUDIX hydrolase 
Protein accessionYP_534691 
Protein GI90426321 
COG category[L] Replication, recombination and repair 
COG ID[COG2816] NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.716993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.684444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGACCAA CCGCCGCAGT CACCTTCCCG GAAATTGCGA GCCTCGATCG CCACGGCGGC 
GCCAGCCTCG CGGACCAGCG CATGACAACA AACGATTCGT TTCCGCTCGG CCAGCCGGCC
TTCGTCACGC ATGTGCTGGA TCGCGCCGCG CATCTGCGCA GCGACGACGA CAAGTTGTTC
AAGCTGGAAA GCGGCCGCGA CGCCCGCGCC TATGTGGTGC ATCGCGATTC GCTGGTGATG
GCCAAGCAGG CCGACGGCGT CCGCGCGCTG TTGACGATCG ACGAGGCTTT GACGTTCGGG
GCCAATTCCG GAACCATCTT TCTCGGCTTG CGCGATGGCG CGCCACTGTT CGGCATGGGG
ATCGCGGCTG ACGCCGTAGA GCGGTTACTG ATCCGCAACG ACGTCGCGGT GAGCGAGCTG
CGCGGCATGG CGATGGAGGG CGCGGTGCCG GCGGGAGAAC TCTCAGCGAT CGCGATGGCG
AAATCGATGG TCAGCTGGCA TCAGCGCCAC GGCTATTGCG CCAATTGCGG CGCCCGCACC
GTGATGTCGC AAGGCGGCTG GAAGCGCGAT TGCCCGAGCT GCAAGGCCGA GCATTTCCCG
CGCACCGATC CGGTGGTGAT CATGCTGGTA ACGTTCGGCG ACAAATGCCT GCTCGGCCGG
CAGAAGCAGT TTCCGCACGG GATGTATTCG TGCCTCGCCG GCTTCGTCGA AGCCGCGGAA
ACCTTCGAGG ACGCGGTGCG CCGCGAGGTG TTCGAGGAAT CCGGGATCCG CTGCGGCGAC
GTCGCCTATT ACATGACGCA GCCCTGGCCC TATCCGTCGT CGCTGATGAT CGGCTGCTCG
GCGCAGGCGA CCACCGAGGA TATCGTGGTC GACCACACCG AACTCGAAGA CGCCCGCTGG
TTTTCCCGCG ACGAGGCGAT GCTGATGCAT CACCGGCGGC ATCCCGACGG GCTGACCGGC
GCGCATTCGT TCGCGATCGC CCACCACCTG CTCGGCCGCT GGCTGCACGG CCCGTCTTCA
GCGACATGA
 
Protein sequence
MRPTAAVTFP EIASLDRHGG ASLADQRMTT NDSFPLGQPA FVTHVLDRAA HLRSDDDKLF 
KLESGRDARA YVVHRDSLVM AKQADGVRAL LTIDEALTFG ANSGTIFLGL RDGAPLFGMG
IAADAVERLL IRNDVAVSEL RGMAMEGAVP AGELSAIAMA KSMVSWHQRH GYCANCGART
VMSQGGWKRD CPSCKAEHFP RTDPVVIMLV TFGDKCLLGR QKQFPHGMYS CLAGFVEAAE
TFEDAVRREV FEESGIRCGD VAYYMTQPWP YPSSLMIGCS AQATTEDIVV DHTELEDARW
FSRDEAMLMH HRRHPDGLTG AHSFAIAHHL LGRWLHGPSS AT