Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2191 |
Symbol | |
ID | 4895103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2321754 |
End bp | 2323235 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640112785 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001044066 |
Protein GI | 126462952 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.395448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.437472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAG ATATCGCTGA CTCTGCCGAG ACCAACATGA AGCTGATCGA GGAGGTGCTG GCCGCCTACC CCGACAAGGC CAGGAAGAAG CGCGCCAAGC ACCTGAATGT CGCAGCGCCC GTCGCCGAGG CCGAACCCGG CCTCCAGTCG AGATGCGACA ATGTGAAATC GAACATCAAG TCGGTCCCCG GCGTGATGAC CATCCGCGGC TGCGCCTATG CCGGCTCGAA GGGCGTGGTC TGGGGCCCGG TCAAGGACAT GCTGCACATC AGCCACGGCC CGGTCGGCTG CGGCCACTAC AGCTGGTCCC AGCGCCGCAA CTACTACACC GGCACGACGG GCGTGGATTC GTTCGTGACG ATGCAGGTCA CCACCGACTT CCAGGAAAAC GACATCGTCT TCGGCGGTGA CAAGAAGCTG GAAAAGACCA TCGACGAGCT GAACATGCTC TTCCCGCTGA ACAAGGGGAT CTCGATCCAG TCGGAATGCC CGATCGGCCT GATCGGCGAC GACATCGAGG CGGTGTCGAA GAAGAAGGCC AAGGACATCG GCAAGCGCGT CGTTCCGGTG CGCTGCGAGG GATTCCGCGG CGTGTCGCAG TCGCTCGGCC ACCATATCGC GAACGACATG ATCCGCGACT GGGTGCTGGA AGCGGGCGAG GGCGCGCGCG CGGGCTACGA GCCCGGCCCC TATGACGTGA ACATCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGTC GAGCCGGATC CTGCTGGAAG AGATCGGCCT CAACGTCATC GCGCAGTGGT CGGGCGACGC GACCATCGCC GAGATGGAGC GCGCTCCGGC GGCCAAGCTG AACCTCATCC ACTGCTACCG CTCGATGAGC TACATCTGCC GGCACATGGA AGAGAACCAC GGCGTGCCGT GGATGGAATA CAACTTCTTC GGCCCCTCTC AGATCGCGGC CTCGCTGCGC GCCATCGCCG CGAAGTTCGA CGACAGGATC CAGGCCAATG CCGAAGCGGT CATCGCGAAA TACCAGCCGC TCGTCGACGC GGTGAACGCG AAATACAAGC CGCGCCTCGA AGGCAAGAAG GTGATGCTCT ATGTGGGCGG CCTGCGTCCG CGCCACGTCG TCGACGCCTA CCATGACCTG GGCATGGAGA TCGTGGGCAC CGGCTACGAA TTCGCCCACA ATGACGACTA CAAGCGCACC GGCCATTACA TCAAGGAAGG CACGCTGATC TTCGACGACG TCTCGGGCTA CGAGCTGGAG AAATTCGTCG AGGCGATCCG TCCCGATCTC GTGGGCTCGG GCATCAAGGA GAAATACAAC ACGCAGAAGA TGGGCATCCC GTTCCGTCAG ATGCACTCCT GGGATTATTC CGGCCCCTAC CACGGCTACG ACGGCTACGC GATCTTCGCG CGCGACATGG ATCTCGCGAT CAACAACCCC GTCTGGGGCA TGTTCGACGC GCCCTGGAAG AAGACGGCCT GA
|
Protein sequence | MAKDIADSAE TNMKLIEEVL AAYPDKARKK RAKHLNVAAP VAEAEPGLQS RCDNVKSNIK SVPGVMTIRG CAYAGSKGVV WGPVKDMLHI SHGPVGCGHY SWSQRRNYYT GTTGVDSFVT MQVTTDFQEN DIVFGGDKKL EKTIDELNML FPLNKGISIQ SECPIGLIGD DIEAVSKKKA KDIGKRVVPV RCEGFRGVSQ SLGHHIANDM IRDWVLEAGE GARAGYEPGP YDVNIIGDYN IGGDAWSSRI LLEEIGLNVI AQWSGDATIA EMERAPAAKL NLIHCYRSMS YICRHMEENH GVPWMEYNFF GPSQIAASLR AIAAKFDDRI QANAEAVIAK YQPLVDAVNA KYKPRLEGKK VMLYVGGLRP RHVVDAYHDL GMEIVGTGYE FAHNDDYKRT GHYIKEGTLI FDDVSGYELE KFVEAIRPDL VGSGIKEKYN TQKMGIPFRQ MHSWDYSGPY HGYDGYAIFA RDMDLAINNP VWGMFDAPWK KTA
|
| |