Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1247 |
Symbol | |
ID | 5084420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1288119 |
End bp | 1289600 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640482805 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001167453 |
Protein GI | 146277294 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.867808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAG ACATTGCAGA TTCTGCGGAA GCCAACATGA AGCTGATCGA GGAGGTGCTT GCCGCCTATC CCGACAAGGC CAAGAAGAAA CGCGCCAAGC ACCTTGGCGT CGCCGAGACG ATTGCCGATG CCGAGCCCGG CATCCAGTCG AAATGCGACA CCGTCAAGTC GAACATCAAG TCGGTTCCCG GCGTGATGAC CATCCGCGGC TGCGCCTACG CCGGCTCGAA GGGCGTGGTC TGGGGTCCGG TCAAGGACAT GCTGCACATC AGCCACGGCC CGGTGGGCTG CGGCCACTAC AGCTGGTCCC AGCGCCGCAA CTACTACACC GGCACCACGG GCATCGACAG CTTCGTGACC ATGCAGGTCA CCACCGACTT CCAGGAAAAC GACATCGTCT TCGGCGGTGA CAAGAAGCTG GAAAAGACCA TCGACGAGCT GAACACGCTC TTCCCGCTGA ACAAGGGCAT CTCGATCCAG TCCGAATGCC CGATCGGCCT GATCGGCGAC GACATCGAGG CGGTGTCGAA GAAGAAGGCC AAGGACATCG GCAAGCGCGT GATCCCGGTG CGCTGCGAGG GCTTCCGCGG CGTGTCGCAG TCGCTCGGCC ACCATATCGC GAACGACATG ATCCGCGACT GGGTGCTCGA GGCGGGCGAG GGCGCGCGGG CGGGTTTCGA GGCCGGCCCC TATGATGTCA ACATCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGTC GAGCCGGATC CTCCTGGAAG AGATCGGCCT GAACGTGATC GCCCAATGGT CGGGCGACGC CACCATCGCC GAGATGGAGC GGGCGCCGGC GGCCAAGCTG AACCTGATCC ACTGCTACCG CTCCATGAGC TACATCTGCC GGCACATGGA AGAGAAGCAC GGCGTTCCGT GGATGGAATA CAACTTCTTC GGGCCGAGCC AGATCGCCGC GTCCCTGCGC GCCATCGCGG CGAAGTTCGA CGCCACCATC CAGGCCAATG CCGAGGCGGT GATCGCGAAA TATCAGCCGC TCGTCGATGC GGTGAACGCG AAATACAAGC CGCGCCTCGA AGGCAAGAAG GTGATGCTCT ACGTCGGCGG CCTGCGTCCG CGCCACGTCG TCGATGCCTA CCACGACCTC GGCATGGAGA TCGTCGGCAC CGGCTACGAG TTCGCCCACA ACGACGACTA CAAGCGCACC GGCCACTACA TCAAGGAAGG CACGCTGATC TACGACGACG TGTCGGGCTA CGAGCTGGAG AAGTTCGTCG AGGCGATCCG TCCCGATCTC GTCGGCTCGG GCATCAAGGA GAAGTACAAC ACGCAGAAGA TGGGCATCCC GTTCCGTCAG ATGCACTCGT GGGACTACTC CGGCCCCTAC CACGGCTACG ACGGCTACGC GATCTTCGCG CGCGACATGG ATCTTGCGAT CAACAACCCG GTCTGGGGCA TGTTCGACGC GCCCTGGAAG AAGACGGCCT GA
|
Protein sequence | MAKDIADSAE ANMKLIEEVL AAYPDKAKKK RAKHLGVAET IADAEPGIQS KCDTVKSNIK SVPGVMTIRG CAYAGSKGVV WGPVKDMLHI SHGPVGCGHY SWSQRRNYYT GTTGIDSFVT MQVTTDFQEN DIVFGGDKKL EKTIDELNTL FPLNKGISIQ SECPIGLIGD DIEAVSKKKA KDIGKRVIPV RCEGFRGVSQ SLGHHIANDM IRDWVLEAGE GARAGFEAGP YDVNIIGDYN IGGDAWSSRI LLEEIGLNVI AQWSGDATIA EMERAPAAKL NLIHCYRSMS YICRHMEEKH GVPWMEYNFF GPSQIAASLR AIAAKFDATI QANAEAVIAK YQPLVDAVNA KYKPRLEGKK VMLYVGGLRP RHVVDAYHDL GMEIVGTGYE FAHNDDYKRT GHYIKEGTLI YDDVSGYELE KFVEAIRPDL VGSGIKEKYN TQKMGIPFRQ MHSWDYSGPY HGYDGYAIFA RDMDLAINNP VWGMFDAPWK KTA
|
| |