Gene Rru_A1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1161 
Symbol 
ID3834671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1378441 
End bp1379526 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content65% 
IMG OID637825250 
ProductNi-Fe hydrogenase, small subunit 
Protein accessionYP_426249 
Protein GI83592497 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAAA CCGAGACCTT TTACGAGGTC ATCCGTCGCC AGGGGATTTC CCGGCGCGGC 
TTCTTGAAGT TCTGCGGTGT CACCGCCGCC GGGCTGGGCC TGGGCGCCGG CGGCGCGGCG
CGCATCGCCC AGGCGCTGGA AACCAAGCCA CGGGTGCCGG TGATCTGGCT GCATGGCCTG
GAATGCACCT GTTGTTCGGA AAGCTTCATC CGCTCGGCCC ATCCGCTGGT CAGCGACGTG
GTGCTGTCGA TGCTGTCGCT CGATTACGAC GACACGCTGA TGGCCGCCGC CGGTCATCAG
GCCGAGGCGA TCCTGGCCGA GACCCGCGAG ACCTATCGCG GGCGCTATAT CCTGGCGGTC
GAGGGCAACG CGCCGCTGGC CAATGACGGC TTTTTCTGTA TGCCCGGCGG TCGGCCCTTC
GTTGATACCC TGAAGGAAAT GGCCGCCGAC AGCGCCGCCG TCATCGCCTG GGGATCGTGC
GCCAGTTGGG GCTGCGTTCA GGCGGCCAAG CCCAATCCCA CCGGGGCGGT GCCGATTGAT
CAGGTGATCA CCGGCAAGCC GCTGATCAAG GTGCCGGGCT GTCCGCCGAT CGCCGAGGTG
ATGACCGGGG TGATCAGCTA CCTGCTGACC TTCGACCGCT TCCCCGAGCT TGATCTGCAG
GGGCGGCCGA AAATGTTCTA TTCCCAACGC ATCCACGACA AATGTTACCG CCGCGGCCAT
TTCGATGCCG GCCAGTTCGT CGAGGCCTTC GACGATGACG CCGCCCGCCG CGGTCACTGT
CTGTACAAGA TGGGCTGCAA GGGGCCCACG ACCTACAACG CCTGTTCGAC CACCGGCTGG
AACGAGGGCA CCTCGTTTCC CATCCAATCG GGCCATGGCT GCCTGGGCTG TTCAGAGGAT
GGCTTTTGGG ACAAGGGGCC GTTCTACGAG CGGTTGTCGA CCATCAATCA GTTCGGGATT
GAAGCCAATG CCGACATCGT AGGCGGAACG GCCGCCGGGG TGGTGGCGGC CGGGGTGGCG
GCCCATGCCG GCGTCACCGT GGCCCGGCGC CTGATGTCGA AGAACGAAAA CAAAGACAAA
GAGTAG
 
Protein sequence
MGETETFYEV IRRQGISRRG FLKFCGVTAA GLGLGAGGAA RIAQALETKP RVPVIWLHGL 
ECTCCSESFI RSAHPLVSDV VLSMLSLDYD DTLMAAAGHQ AEAILAETRE TYRGRYILAV
EGNAPLANDG FFCMPGGRPF VDTLKEMAAD SAAVIAWGSC ASWGCVQAAK PNPTGAVPID
QVITGKPLIK VPGCPPIAEV MTGVISYLLT FDRFPELDLQ GRPKMFYSQR IHDKCYRRGH
FDAGQFVEAF DDDAARRGHC LYKMGCKGPT TYNACSTTGW NEGTSFPIQS GHGCLGCSED
GFWDKGPFYE RLSTINQFGI EANADIVGGT AAGVVAAGVA AHAGVTVARR LMSKNENKDK
E