Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2608 |
Symbol | |
ID | 6410270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2820372 |
End bp | 2821751 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642712486 |
Product | Nitrogenase |
Protein accession | YP_001991596 |
Protein GI | 192290991 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00388329 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCTC CCGTCACCAA CACCCAGGAC CCGACCGGTT CTTTCTGCAG TTCGCCGGGC GCGATCGAGC AGGTGCGCTA CGGCTGCAGC CTCGGCGCGC TCGCCAGCGT GATCGCCATT CCCGGCGCGA TCCCGATCAC GCATTGCGGG CCGGGCTGCG CCACCAAGCA GTTTCACGCG CTGTCCGGCA TCAACGCCTA TCAGGGCGGC GAATTCCACG TGCCGAGTAC CAATCTCGGC AATCAGGAAG TGATCTTCGG CGGCGCCGAC CGGCTCGACG AGCTGATCGC TTCCACGCTC AAGGTGATGG AAGCCGATCT GTTCGTGGTG CAGACCGGCT GCATCCCAGG TCTTGTCGGC GACGACGTCG GCAGCGTGGT CCGCAAGTAT CAGAAACGCG GCGTGCCGAT CGTGGTCGCC GAGACCAGCG GCTATCGCGG CAACAATTTC ACCGGCCATG AGACGGTGAT CCGCGCCATC ATCGATCAGT TCGTCGACGC CGGCGATGCG CCGCTGCAGC ATGGCCTGGT CAATGTCTGG TCGCTGCTGC CGTATCAGAA TCCGTTCTGG CGTGGCGATC TGTCCGAGAT CCGCCGCATC CTCGAAGGCA TCGGCCTTCA GGTGAATATT CTGTTCGGCC CGGCTTCGGC CGGCGTCGCG GAATGGCGCG CGATCCCGCG GGCGCAGTTC AATCTGGTGC TGTCGCCCTG GCTCGGGCTG TCGACCGCGC AGCATCTCGA AGAGCTTTAC GGCCAGCCGT TCCTGCACGA GCCGACGATC CCGATCGGCG CCAAGGCGAC CAGCGCATTT CTTCGGCGCG TCGTGGAGTT CGCCGGGCTC GACCGCGCCC GCGCCGAAGC GTTCATCGCA CAGGAGGAGA AAGAGCATTA CGTCTATTTG CGCGACTTCG CCTGCTTCTA TGCCGGCTCG ACCAGCCAGT ACCGCCTGCC GTCGCAAGCG GTCGTGATCA GTGAGAGCGC CTATAGTCTC GCGGTCGCGA GCTTCCTGGT CGAGCAACTC GGCCTCAATC CGGGACCGTT TGTGATCAGC GAGAATCCGC CGGAGGAACT GCGCGAGACG ATCCAGAAGC AGTATCGCGC GCTCGCCGAG CATTCCGGCG CCGAAGCCGT GTTCGAGCAG GACGGCAAGC GCATCCACGA CATCGTACGC GCCACTGATT TCACCGGCGA GCTCCCGATC GTGTTCGGCT CGACCTGGGA GGCGGCGCTG GCCGACGAGA TCGGGGCGCC GCTGGTCGAG ATCGGCTATC CGTGCACTGA CGAGGTGGTG TTGTCGCGCG CCTATGTCGG CTATCGCGGC GCGTTGCAAT TGATCGAGCG AACCTACACC ACGGTGGTGC GCGCCAGCAC CATGGCATGA
|
Protein sequence | MLSPVTNTQD PTGSFCSSPG AIEQVRYGCS LGALASVIAI PGAIPITHCG PGCATKQFHA LSGINAYQGG EFHVPSTNLG NQEVIFGGAD RLDELIASTL KVMEADLFVV QTGCIPGLVG DDVGSVVRKY QKRGVPIVVA ETSGYRGNNF TGHETVIRAI IDQFVDAGDA PLQHGLVNVW SLLPYQNPFW RGDLSEIRRI LEGIGLQVNI LFGPASAGVA EWRAIPRAQF NLVLSPWLGL STAQHLEELY GQPFLHEPTI PIGAKATSAF LRRVVEFAGL DRARAEAFIA QEEKEHYVYL RDFACFYAGS TSQYRLPSQA VVISESAYSL AVASFLVEQL GLNPGPFVIS ENPPEELRET IQKQYRALAE HSGAEAVFEQ DGKRIHDIVR ATDFTGELPI VFGSTWEAAL ADEIGAPLVE IGYPCTDEVV LSRAYVGYRG ALQLIERTYT TVVRASTMA
|
| |