Gene RPD_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1160 
Symbol 
ID4021636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1319752 
End bp1320753 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID637961352 
ProductNADH ubiquinone oxidoreductase, 20 kDa subunit 
Protein accessionYP_568299 
Protein GI91975640 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCGT TCAATGTGCT GTGGCTGCAG GGCGCGAGCT GCGGCGGCTG CACCATGGCG 
GCGCTCGACA ACGGCCACTC CGGCTGGTTC GCCGACCTCG CCCGTTTCGG CATCGATCTG
ATCTGGCATC CCTCGCTGAG CGAGGCGACC GCAGACGAAG CGGTCGCGAT CTTCGAGCGC
GTGGCGGACG GCGCGCAGCG GCTCGACGCG CTGGTGCTGG AAGGCGCGGT GCTGCGCGGG
CCGAACGGCA CCGGCCGCTT CAACATGCTC GGCGGCACCC ATCGCGCGAT GCTGCATTGG
GTCCGCGCGC TCGCGCCCTG CGCGAACTAC GTCGTCGCCG CCGGGAGTTG CGCTGCATTC
GGCGGCGTGC CGATGGCCGG CAGCAATCCG ACCGACGCCA GCGGCCTGCA GTTTGCGGGC
GTGGAAGCAG GCGGCGCGCT CGGCGCGGCG TTTCGATCCC GCGCCGGCCT GCCGGTGATC
AACATCGCCG GCTGCGCGCC GCATCCGGGC TGGATTGCCG AAACACTGGC GGCGCTGGCG
CTCGGCGGTT TCGACAGCGA AGCGCTCGAC AGCTTCGGCC GGCCGCGATT CTACGCCGAT
CACCTCGCGC ATCACGGCTG CGCCCGCAAC GAGTATTACG AGTTCAAGGC CAGCGCCGAG
ACGCTGTCGC AGCAAGGCTG CCTGATGGAG CATCTCGGCT GCAAGGCGAC CCAGGCGGTC
GGCGACTGCA ACCAGCGCGG CTGGAACGGC GGCGGCTCCT GCACCAGCGG CGGCGGCGCC
TGCATCGCGT GTACGTCGCC CGGCTTCGAG GCATCGCAGA ACTTCATGGA GACCGCCAAG
CTCGGCGGCA TTCCGGTCGG CCTGCCGCTC GACATGCCGA AGGCGTGGTT CGTGGCGCTG
GCCGCGTTGT CGAAATCGGC GACGCCGAAA CGGGTGCGCG CCAATGCGAC CGCCGACCAT
GTGATCGTGC CGCCGCGCAC TGATCACGGA CGCCGCAAAT GA
 
Protein sequence
MEPFNVLWLQ GASCGGCTMA ALDNGHSGWF ADLARFGIDL IWHPSLSEAT ADEAVAIFER 
VADGAQRLDA LVLEGAVLRG PNGTGRFNML GGTHRAMLHW VRALAPCANY VVAAGSCAAF
GGVPMAGSNP TDASGLQFAG VEAGGALGAA FRSRAGLPVI NIAGCAPHPG WIAETLAALA
LGGFDSEALD SFGRPRFYAD HLAHHGCARN EYYEFKASAE TLSQQGCLME HLGCKATQAV
GDCNQRGWNG GGSCTSGGGA CIACTSPGFE ASQNFMETAK LGGIPVGLPL DMPKAWFVAL
AALSKSATPK RVRANATADH VIVPPRTDHG RRK