Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2672 |
Symbol | |
ID | 5209641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3318633 |
End bp | 3319709 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596274 |
Product | NADH-ubiquinone oxidoreductase, chain 49kDa |
Protein accession | YP_001276996 |
Protein GI | 148656791 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCATACT CGCTGGCGCT GGGACCTTTC CACCCTGCAT GGCTCGGTCC GCAGCGCTTC ATTCTGCACA TTGCCGACGA TCGCGTCGTC GATGTCGAGT ATCAGAACGG GTTCAACGAA CGCGGTTGCG CCGAGCGATT ACCACGATTG CCGCTGCCCG ATGCGCTGCA CCTGGTTGCG CGGATCTGCG GTGAGTGTTC CTTTGCCCAT TCCCTGGCGT TCTGTCAGGC GATCGAGCAG GTACAACGGC GAAAAGTGGG CGCGCGCGCC GCGCTGCTCC GGGTTGCAAT CGCCGAGTTG GAACGGACGG CGGCCCATCT CTATACAGCG CGCGCCGTGC TGGACGCAAT CGGCATGGAG CAGCGCGGCG CAATCCTCGA TCATTTTTGC CAGCAGTCCC GTGATATGCT GGCGCTGGTC ACCGGCGGTC GCATTCCGCC GCCTGTGTGT GCACCGGGTG GTCTGTTGCG CGACCTGACG CCGGTTGAGC GGCAAGAGGT ACTGGCGATG TTGCCGGGAA TGAGCGCCGA CCTCTATCGC TTCATTGATC GCCTCATCGA TCAGCGCTTC TTGCTTATGC GCACCGTCGA GGTTGGGGTG TTGCCGCGCG CGGCAGCCGA GCAGTTTGGC GTGCGCGGTC CACTGGCGCG CGCTTCCGGC ATCCGTCGCG ATGTCCGAGT CGATCAGCCC TACGCAGCAT ACAGCATGCT CGATGTTCAA CCGATCATTC AGGAAGGCGG CGATGTGTAC GCGCGCCTGC TGTTACTCTT GCTGGAAGCG TATGAGGGTG TCAAACTCGT CGAATCTGCA CTGCAACGCC TCCCTGAAGA AGACCCGATC GTCGAACTGC CGCACGAATT GCCGCGTGGT CAGGCCTCCT CGGTTGTTGA GGGTCCTCGA GGCGCCATTC GTTATACACT CGAGAGCGAT GGCGTGCGAT TGACCCGGGT GCAGATTGAT ACGCCGCGTC AGTGCGATCG ACTGCTGGCG CGCACCCTGC TGAGCCGGGC GCAACTGGAT GATGTGATGG CAATTCTGGC GTCGCTCGCA GTGTGTGTCG CCTGCGCCGA GCAATAG
|
Protein sequence | MSYSLALGPF HPAWLGPQRF ILHIADDRVV DVEYQNGFNE RGCAERLPRL PLPDALHLVA RICGECSFAH SLAFCQAIEQ VQRRKVGARA ALLRVAIAEL ERTAAHLYTA RAVLDAIGME QRGAILDHFC QQSRDMLALV TGGRIPPPVC APGGLLRDLT PVERQEVLAM LPGMSADLYR FIDRLIDQRF LLMRTVEVGV LPRAAAEQFG VRGPLARASG IRRDVRVDQP YAAYSMLDVQ PIIQEGGDVY ARLLLLLLEA YEGVKLVESA LQRLPEEDPI VELPHELPRG QASSVVEGPR GAIRYTLESD GVRLTRVQID TPRQCDRLLA RTLLSRAQLD DVMAILASLA VCVACAEQ
|
| |