Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_0089 |
Symbol | |
ID | 5420792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | + |
Start bp | 97375 |
End bp | 98880 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640879334 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001415005 |
Protein GI | 154244047 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGG CCCAACCGCA AAGCGTTGCT GAAATCAAGG CGCGTAACAA AGAACTCATC GCGGAAGTTC TCAAGGTTTA TCCCGAGAAG ACCGCGAAGC GCCGCGCGAA GCACCTCAAC GTCCACGAGT CCGGCAAGTC CGATTGCGGC GTGAAGTCGA ACATCAAGTC CATCCCGGGC GTGATGACGA TCCGCGGCTG CGCGTATGCC GGTTCCAAGG GCGTGGTGTG GGGTCCCATC AAGGACATGA TCCACATCTC CCACGGCCCG GTGGGCTGCG GCCAGTACAG CTGGGCCGCC CGCCGCAACT ACTACATCGG CACGACGGGC ATCGACACCT TCGTGACCAT GCAGTTCACC TCGGACTTCC AGGAAAAGGA CATCGTCTTC GGCGGCGACA AGAAGCTCGC CAAGATCATG GACGAAATCC AGGACCTGTT CCCGCTGAAC AACGGCATCA CCGTCCAGTC CGAGTGCCCG ATCGGCCTCA TCGGCGACGA CATCGAGGCC GTGTCCAAGG CGAAGTCCAA GGAATACGAC AACAAGACCA TCGTGCCGGT CCGCTGCGAG GGCTTCCGCG GCGTGTCCCA GTCGCTCGGC CATCACATCG CCAACGATGC GATCCGGGAC TGGGTGTTCG ACAAGATGGA CCCGAATGCG GCCCCCCGCT TCGAGCCGTC CCCGTATGAC GTCGCCATCA TCGGCGACTA CAACATCGGT GGCGACGCCT GGTCTTCGCG CATCCTGCTC GAGGAAATGG GCCTGCGCGT GATCGCACAG TGGTCCGGCG ACGGCTCGCT TGCCGAGCTC GAGGCGACTC CGAAGGCGAA GCTGAACGTC CTGCACTGCT ACCGGTCCAT GAACTACATC TCCCGCCACA TGGAAGAGAA GTTCGGTATC CCGTGGTGCG AGTACAACTT CTTCGGCCCG TCCAAGATCG CCGAGTCGCT GCGTAAGATC GCCGCTTACT TCGACGACAA GATCAAGGAA GGCGCCGAGC GCGTCATCGC CAAGTACCAG CCCCTCATGG ACGCGGTCAT TGCCAAGTAC CGTCCGCGGC TCGAGGGCAA GACCGTCATG CTGTACGTGG GCGGCCTGCG TCCCCGTCAC GTGATCGGCG CCTACGAAGA CCTCGGCATG GAAGTGGTCG GCACCGGCTA CGAGTTCGGC CACAACGACG ACTATCAGCG CACCGCCCAG CACTACGTCA AGGATGGCAC GATCATCTAT GACGACGTGA CCGGCTATGA GTTCGAGAAG TTCGTCGAGA AGGTCCAGCC CGATCTCGTC GGCTCGGGCA TCAAGGAAAA GTACGTGTTC CAGAAGATGG GTGTGCCGTT CCGGCAGATG CACTCCTGGG ACTATTCCGG CCCGTATCAC GGCTATGACG GCTTCGCCAT CTTCGCCCGC GACATGGACA TGGCCATCAA CTCCCCCGTC TGGAAGATGA CCAAGGCTCC GTGGAAGCAG GCTCCCGCGC AGCCGGGCCT GCTCGCGGCC GAGTGA
|
Protein sequence | MSLAQPQSVA EIKARNKELI AEVLKVYPEK TAKRRAKHLN VHESGKSDCG VKSNIKSIPG VMTIRGCAYA GSKGVVWGPI KDMIHISHGP VGCGQYSWAA RRNYYIGTTG IDTFVTMQFT SDFQEKDIVF GGDKKLAKIM DEIQDLFPLN NGITVQSECP IGLIGDDIEA VSKAKSKEYD NKTIVPVRCE GFRGVSQSLG HHIANDAIRD WVFDKMDPNA APRFEPSPYD VAIIGDYNIG GDAWSSRILL EEMGLRVIAQ WSGDGSLAEL EATPKAKLNV LHCYRSMNYI SRHMEEKFGI PWCEYNFFGP SKIAESLRKI AAYFDDKIKE GAERVIAKYQ PLMDAVIAKY RPRLEGKTVM LYVGGLRPRH VIGAYEDLGM EVVGTGYEFG HNDDYQRTAQ HYVKDGTIIY DDVTGYEFEK FVEKVQPDLV GSGIKEKYVF QKMGVPFRQM HSWDYSGPYH GYDGFAIFAR DMDMAINSPV WKMTKAPWKQ APAQPGLLAA E
|
| |