Gene RPD_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3644 
Symbol 
ID4024158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4066954 
End bp4068093 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID637963848 
Producthypothetical protein 
Protein accessionYP_570768 
Protein GI91978109 
COG category[S] Function unknown 
COG ID[COG4645] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.82541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.662915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCA TGAACGACGC AACGAAATCA GGCGGCACGG GACGGGATCT GCGTCTTGAT 
CTGTTCCGGG GCATGGCGAA CTGGGCGATC TTCCTGGATC ACGTGCCCAA CAACGTGGTC
GCGTGGCTCA CCATGCGGAA CTACGGGTTT AGCGACGCGG CGGAGCTGTT CGTTTTCGTA
TCGGGGTTCA CGGTCGCGTT CGTGTACTCG AAGACGCTGC ATGCGAAGGG TATTCTTGCA
GCGACCGCCG GGATCCTCGG CCGGGTCTGG CAGATCTACG TCGCCTACGT GCTCCTTTTC
GTCTTCTACG TGGTGGCGGT CGGCTACGTC GCTCAGCGCT ACGGTCACGC CCATCTGCTC
GACGAATACA ACATCCGCAG CCTCATCGCC GATCCGGTCG AGTTTCTGAA ACATGGGTTG
CTGCTAGAGT ATCGTCCCCT CAACCTCGAC GTGCTGCCGC TGTACATCGC CCTGATGGCC
CCGTTTCCCT TGGTGCTCTG GTCGTTGACC AAGGCCCCAG GCGTCACGTT GGCGGGCTCG
ATCGCCCTCT ATGCGGCGGC CCGGTCGTTC GGCTGGAACC TTCCGGGTTA CCCGGCAGGA
TATTGGTATT TCAATCCGTT CGCCTGGCAG CTCCTTTTTG TGATCGGTGC GTGGACGGCC
ACCGTCGATC GCGGCACGTT GGATCGGACG TTGCGCTCGG GCATCGTGCT GCCGCTCGCC
ATCGCTGTCG TCGCTATTTC GGCGATCGTT ATGCTTGCGC CACTGGCGGG AAATGCCTGG
TTGCTGCCGG AGATGCTGCG CCTTCCCTTT CCGATGGCCG ACAAGACGAA CCTTGCCCCT
TACCGCATCG CCCACTTCCT AGCCTTGGCG ATCATCGTGG CGCGCCTCGT TCCGAGAAAC
GCGCCGGCGC TGGCCTGGCC GGTCTGGCGG CCGTTGATCG TCAGCGGGCA GCATTCGCTC
GAAGTTTTCT GCGCGGGAAC GTTTTTCGCG GCCATCGCCT ATTTCACCCT CGATCTCGTC
GATGGATCCG TCAGATCCCA GCTCGTCGTG AGCGCGGCGG GCATCTGCGC GATGGTCGCC
GTGGCCTACT TCCGAAAGTG GTCGAAAGAG AACAAATTGC GCCCTGCGAT CGCGAGATGA
 
Protein sequence
MDIMNDATKS GGTGRDLRLD LFRGMANWAI FLDHVPNNVV AWLTMRNYGF SDAAELFVFV 
SGFTVAFVYS KTLHAKGILA ATAGILGRVW QIYVAYVLLF VFYVVAVGYV AQRYGHAHLL
DEYNIRSLIA DPVEFLKHGL LLEYRPLNLD VLPLYIALMA PFPLVLWSLT KAPGVTLAGS
IALYAAARSF GWNLPGYPAG YWYFNPFAWQ LLFVIGAWTA TVDRGTLDRT LRSGIVLPLA
IAVVAISAIV MLAPLAGNAW LLPEMLRLPF PMADKTNLAP YRIAHFLALA IIVARLVPRN
APALAWPVWR PLIVSGQHSL EVFCAGTFFA AIAYFTLDLV DGSVRSQLVV SAAGICAMVA
VAYFRKWSKE NKLRPAIAR