Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1199 |
Symbol | |
ID | 5208151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1469478 |
End bp | 1470908 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594817 |
Product | nitrogenase component I, alpha chain |
Protein accession | YP_001275556 |
Protein GI | 148655351 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01284] nitrogenase alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTTCA AATGCAATCA GACCCTGCCT GAGCGAGCGA TCCATATCGC GCTCAAGGGA CCGGGCGGGA AGTGTCAGCG CGGCGATGGC ACCACCTGTT TCATTGCCAA CAACGTGGCA ACGACGCCTG GCGATATGAC CGAGCGCGGC TGCACCTACG CCGGCTGTCG CGGCGTCGTC GGCGGACCGG TCAAGGACGC TATTCAACTG ACCCACGGAC CGATCGGGTG CGCGTTCTTC TCCTGGGGCT ACCGTCCGCA CCTCGCCGAC AGCGATTTTC ACATGAAGTA CACCTTCGTC ACCGATATGA ACGAAACCAA CATCGTCTTC GGCGGCGAGA AAAAGCTGCT TCAGTCGATC ATCGAAGCCA ATGCCGAGTT TCCCAATGCG AAGGCGGTGT TCGTCTACAA CACCTGCTCT ACGGCGCTGA TCGGCGATGA CGGGCGCGAC GTCGCCAAAC AGGCGGAAGC GATCATCGGC AAACCGGTGG TGTTCTTCGA GTGCGAGGGG TTTCGTGGCG TCAGTCAGTC GATGGGGCAC CACGTCGGCA ACGAGACGAT CTTTCGTCAA CTGGTCGGCT CGGTCGAACC GGAGGGCGAT TTCAGCCGCT CGATCAACAT CATCGGCGAC TACAACATCA AGAATGACAT CCGCACCTTC GAGTATCTCT TCGAGGCGCT TGGCTTGCGG ATCATCGCCC GCTTCACCGG GAACGTCTCA GTGGACGACC TGAAGATCAT GCACAAAGCG GCGCTCAATA TCGTGCACTG CCAGCGATCC GCCACCTACA TCGCCGACAT GATGAAGGAT AAGTATGGCA CGCCGTACAT CAATGTCACG CTCTGGGGCA TGAAGAACAT GGCAAAAGCG CTGCGCGACA CCGCCGCGTT CTTCGGGCTT GAAGCGCGCG CCGAAGAAGT GATCGCCCGA GAAGTGGCGC GCATTCAACC CTACATCGAC GCCTATCGTC AACGCCTGCA GGGGAAGCGC GTCTTCATCT ACCAGGGCGG TCCGCGCGTC TGGCACTGGA TCGAACTCCT GCGCGAATTG GGCATGGAGA CCGAGACGGC AGCCACAACC TTCGGGCATA CCGATGATTA CGAGAAGATA TTCAACCAGA TCGGCGAAGG CGCGCTGGTT ATCGACAACC CGAATGTCCC CGAAATCGAA GAAATCCTGA CCCGTCGCCG TCCCGACCTG TTCATCTCAG GCAACAAGGA GCGATACCTG GCATACAAAA TGGGCGTGCC GTTCGTCAAT GGGCATACCT ACGACACCGG ACCCTACGCC GGTTTTGTGG GCATGGTCAA CTTCGCGCGC GATATCGATA AAGCCCTGCA TGCGCCGGTC TGGAATATCG TGCATCAGCA CGCCCGACCT GCACCCGTCG CCCGCCACGC AGTCCACGGA TCTGAGGAGG TGGAGTCATG A
|
Protein sequence | MQFKCNQTLP ERAIHIALKG PGGKCQRGDG TTCFIANNVA TTPGDMTERG CTYAGCRGVV GGPVKDAIQL THGPIGCAFF SWGYRPHLAD SDFHMKYTFV TDMNETNIVF GGEKKLLQSI IEANAEFPNA KAVFVYNTCS TALIGDDGRD VAKQAEAIIG KPVVFFECEG FRGVSQSMGH HVGNETIFRQ LVGSVEPEGD FSRSINIIGD YNIKNDIRTF EYLFEALGLR IIARFTGNVS VDDLKIMHKA ALNIVHCQRS ATYIADMMKD KYGTPYINVT LWGMKNMAKA LRDTAAFFGL EARAEEVIAR EVARIQPYID AYRQRLQGKR VFIYQGGPRV WHWIELLREL GMETETAATT FGHTDDYEKI FNQIGEGALV IDNPNVPEIE EILTRRRPDL FISGNKERYL AYKMGVPFVN GHTYDTGPYA GFVGMVNFAR DIDKALHAPV WNIVHQHARP APVARHAVHG SEEVES
|
| |