Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1180 |
Symbol | |
ID | 4895725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1224635 |
End bp | 1225930 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640111766 |
Product | NADH dehydrogenase I subunit F |
Protein accession | YP_001043062 |
Protein GI | 126461948 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | [TIGR01959] NADH-quinone oxidoreductase, F subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.258408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.998026 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGG ACCAGGACCG GATCTTCACC AACCTCTACG GGATGCAGGA CCGCAGCCTG AAGGGCGCGA TGGCCCGGGG CCAATGGGAC GGGACCGCGG ACCTGCTCGC GCTCGGGCGC GATAAGATCA TCGACATCGT GAAGGCCTCG GGTCTGCGCG GCCGCGGCGG GGCGGGCTTC CCCACCGGCC TCAAATGGTC CTTCATGCCC AAACAGTCCG ACGGGCGCCC GGCCTATCTC GTCATCAATG CCGACGAATC CGAGCCCGCG ACCTGCAAGG ACCGCGAGAT CATGCGCCAC GACCCGCACA CGCTGGTCGA GGGGGCGCTG CTCGCAGGCT TCGCGATGGG GGCGGTTGCG GCCTACATCT ACATCCGCGG CGAATATGTC CGCGAGAAGG AAGCGCTGCA GGCCGCCATC GACGAGGCCT ATGACGCGGG CCTGATCGGC CGGAACGCGG CGAAGTCGGG CTACGATTTC GACGTCTACC TCCACCACGG GGCGGGCGCC TACATCTGCG GCGAAGAGAC TGCGCTGCTC GAGAGCCTCG AGGGCAAGAA GGGGATGCCG CGGATGAAGC CGCCGTTCCC GGCGGGCTCG GGCCTCTACG GTTGCCCCAC GACGGTGAAC AACGTCGAGT CGATCGCCGT CATTCCTGCG ATCCTGCGCC GCGGCGGCGA GTGGTTCGCC TCGTTCGGCC GGCCGAACAA CGCAGGCGTG AAGCTCTTCG CCATGTCGGG ACACGTCAAC ACGCCCTGCG TCATCGAGGA GAGCATGTCG ATCTCGATGA AGGAACTCAT CGAGAAGCAC GGCGGCGGCG TCCGCGGCGG CTGGAAGAAC CTCAAGGCTG TCATTCCCGG CGGCGCCTCC TGCCCGATCA TCCCGGCCGA GCAGTGCGAG GATGCCGTCA TGGACTATGA CGGGATGCGC GAGCTGAAAT CCTCGTTCGG CACGGCCTGC ATGATCGTGA TGGACCAGCA GACCGACGTC ATCAAGGCGG TCTGGCGGCT GGCCAAGTTC TTCAAGCACG AGAGCTGCGG CCAGTGCACG CCCTGCCGCG AGGGCACGGG CTGGATGATG CGCGTGATGG ACCGGCTGGT GCGCGGCGAA GCCGAGGTCG AAGAGATCGA CATGCTGCTG TCGGTGACGA AGCAGGTCGA GGGCCACACG ATCTGCGCGC TCGGCGATGC GGCGGCCTGG CCGATCCAGG GCCTCATCCG GCACTATCGC GAAGAGATCG AGGACCGGAT CAAGGCCAAG AAGACGGGGC GCATGGGCGC CATGGCGGCG GAGTGA
|
Protein sequence | MLKDQDRIFT NLYGMQDRSL KGAMARGQWD GTADLLALGR DKIIDIVKAS GLRGRGGAGF PTGLKWSFMP KQSDGRPAYL VINADESEPA TCKDREIMRH DPHTLVEGAL LAGFAMGAVA AYIYIRGEYV REKEALQAAI DEAYDAGLIG RNAAKSGYDF DVYLHHGAGA YICGEETALL ESLEGKKGMP RMKPPFPAGS GLYGCPTTVN NVESIAVIPA ILRRGGEWFA SFGRPNNAGV KLFAMSGHVN TPCVIEESMS ISMKELIEKH GGGVRGGWKN LKAVIPGGAS CPIIPAEQCE DAVMDYDGMR ELKSSFGTAC MIVMDQQTDV IKAVWRLAKF FKHESCGQCT PCREGTGWMM RVMDRLVRGE AEVEEIDMLL SVTKQVEGHT ICALGDAAAW PIQGLIRHYR EEIEDRIKAK KTGRMGAMAA E
|
| |