Gene RPC_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1044 
Symbol 
ID3969654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1145460 
End bp1146467 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content65% 
IMG OID637924155 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_530927 
Protein GI90422557 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG TTTCACAGGT CAAATTCCAA GACATCGGTG TGGTGGTATG CGAGCAGGTC 
GACGTGACCT TGCCCGCGCT GGCGGCCGAT GAGGTCCGTA TCGAGCCGGC CTTTATCGGC
GTCTGCGGCT CCGAGCTGCA CGTGCTGGCG GGCGGGCATC CGTTCGCCAA ACCGCCGATG
GTGACCGGCC ACGAGATTTC CGCCGTCGTA GTGGAAGTCG GCAGCGGCGT CACCAACGCA
CGGGTCGGCG ATCACGTCGT AGTCGATCCG ATCATGGCGT GCGGCAAATG CCGTGCCTGT
CTCTCGGGTC GCTTCAATCT CTGTGAGCCT CCGCAGGTCG CGGGCTTCCG TGCGCCCGGC
CTAGGGCGCT CCTCGCATGT CGTTCCGGCG CGCAACCTGC ACCTTGCGCC AAAGGGCTTG
GCGATGGAGA TCTTGGCGTT CGCCGAGCCG GTGGCCTGCG CGCATCACTG CGTCAGTCGG
ATGCCGGTCG ATGCGCGCGA GGACGTGCTG GTGATCGGCG CCGGCACCAT CGGGCTGTCG
ATTGTGCAGG CGTTGCGCAT CATGGGCGCG CAGAAAATCA CCGTGGTCGA ACCCGAAGCG
CGCAAGCGGG AGTTGGCCTT GCAGATGGGC GCGCACCGCG TCGTCGCGCC GGGCGAATTG
GCCGCGGAGG AACGGTTTAC GGGTGTCATC GACGTGGTGG CGGCGCAGGC GACGCTCACC
GAAGCCTGCA CCAAGGTGAT GGCCGGCGGC ACCGTCATCT GCATGGGCGT GCCCTCCGGC
CCGCGCGAAA TTCCGCTGCC GAGCATGCAG CGCTTTGAAC GCGACCTCTT GAGCTCCGGC
ATGTACGTGC CGTCGGATTT CGACGTTGCG ATCGCCTGGC TGGCGGATGG CAGCTTCGAT
ACTTCGAAGC TGATCACCGA TGTGTTCCCG ATCGAACAGG CCGCCGATGC CTATGCCCGG
GCCAAGCAGC CGGAATCCAT CAAGGTGCTT ATCAAGTTCA AAAACTGA
 
Protein sequence
MTIVSQVKFQ DIGVVVCEQV DVTLPALAAD EVRIEPAFIG VCGSELHVLA GGHPFAKPPM 
VTGHEISAVV VEVGSGVTNA RVGDHVVVDP IMACGKCRAC LSGRFNLCEP PQVAGFRAPG
LGRSSHVVPA RNLHLAPKGL AMEILAFAEP VACAHHCVSR MPVDAREDVL VIGAGTIGLS
IVQALRIMGA QKITVVEPEA RKRELALQMG AHRVVAPGEL AAEERFTGVI DVVAAQATLT
EACTKVMAGG TVICMGVPSG PREIPLPSMQ RFERDLLSSG MYVPSDFDVA IAWLADGSFD
TSKLITDVFP IEQAADAYAR AKQPESIKVL IKFKN