Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1620 |
Symbol | |
ID | 6409277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1733902 |
End bp | 1735287 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642711509 |
Product | Fe-only nitrogenase, beta subunit |
Protein accession | YP_001990624 |
Protein GI | 192290019 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR02931] Fe-only nitrogenase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.855819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGTC AATTCAAGCC GAAGGATCGG ATCGGCACCA TCAATCCGAT CTTCACCTGC CAGCCGGCCG GCGCCCAATA CGCCTCGATC GGTATCAAGG ACTGCATCGG CATCGTCCAC GGCGGCCAGG GCTGCGTGAT GTTCGTCCGC CTGCTGATCT CGCAGCACCT CAAAGAGAGC TTCGAGATCG CCTCCTCGTC GGTGCACGAG GACGGCGCAG TGTTCGGCGC ACTCGACCGC GTCGAGGAAG CGGTCGACGT GCTGCTGATG CGCTACCCGC ACGTCAAGGT GGTGCCGATC ATCACCACCT GCTCGACCGA AGTAATCGGC GACGACGTCG ACGGCGTCGT CACCAAGCTG CAGGAAGAGC TGCTCGACGT GAAATATCCG GACCGCGAGG TCCATCTGAT CCCGATCCAC TGCCCGAGCT TCGTCGGCAG TATGGTGACG GGCTACGACG TCGCCGTGCG CGACTTCGTC AAGCATTTCT CCAAGAAGGA CAAACCGAGC CTGAAGATCA ATCTGATCAC CGGCTGGGTC AATCCGGGCG ACGTCAAGGA GCTCAAGCAT CTGCTCGCCG AGATGCAGGT CGAAGCCAAC GTGCTGTTCG AGATCGAGAG CTTCGATAGC CCGCTGATGC CGGACAAGAG CGCGGTGTCG CACGGCTCGA CCACGATTGC GGACCTGACC GCAACCGCCG ATGCGCAAGG CACCATCGCA CTCAACCGCT ACGAAGGTGG CCTCGCCGCC ACCTGGCTGC AGACCCGCTG GGACGTGCCG GCAGTGATCG GCTCAACCCC GATCGGGATC CGCAACACCG ACACCTTCCT GCGCAACGTC AAGGACATGA CAGGCAAGCC GATCCCGGAA TCGCTGGTCA AGGAACGCGG CATCGCGCTC GACGCCATCA CCGACCTCGC CCACATGTTC CTGGCCGACA AGAAGGTGGC GATCTACGGC AATCCGGATC TGGTGCTCGG CCTCGCCGAG TACTGCCTCG ACCTGGAAAT GAAACCGGTG CTGCTGCTGC TCGGCGACGA GAACGCCGGC TACGCCAACG ACCCTCGGAT CGCGGCGCTG AAGGCCAATG TCGACTACCC GATGGAGATC ATCACCAACG CCGACCTGTT CGAGCTCGAA GAGCGCATCA AGAACGGCCT CGAGCTCGAT CTGATCCTCG GCCACTCCAA GGGACGGTTC GTCGCGATCG ACTACAACAT CCCGATGCTA CGGGTCGGCT TTCCGACCTA CGACCGCGCC GGCCTGTATC GCTACCCCGT GGTCGGCTAC GCCGGCGCGA CCTGGCTCGC CGAACAGATG GCCAACACGC TGTTCAACGA CATGGAGGTC AAGAAGAACC GCGAATGGAT CCTGAACGTG TGGTGA
|
Protein sequence | MNCQFKPKDR IGTINPIFTC QPAGAQYASI GIKDCIGIVH GGQGCVMFVR LLISQHLKES FEIASSSVHE DGAVFGALDR VEEAVDVLLM RYPHVKVVPI ITTCSTEVIG DDVDGVVTKL QEELLDVKYP DREVHLIPIH CPSFVGSMVT GYDVAVRDFV KHFSKKDKPS LKINLITGWV NPGDVKELKH LLAEMQVEAN VLFEIESFDS PLMPDKSAVS HGSTTIADLT ATADAQGTIA LNRYEGGLAA TWLQTRWDVP AVIGSTPIGI RNTDTFLRNV KDMTGKPIPE SLVKERGIAL DAITDLAHMF LADKKVAIYG NPDLVLGLAE YCLDLEMKPV LLLLGDENAG YANDPRIAAL KANVDYPMEI ITNADLFELE ERIKNGLELD LILGHSKGRF VAIDYNIPML RVGFPTYDRA GLYRYPVVGY AGATWLAEQM ANTLFNDMEV KKNREWILNV W
|
| |