Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1955 |
Symbol | |
ID | 6463074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 2042975 |
End bp | 2044327 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642728158 |
Product | Nitrogenase |
Protein accession | YP_002018788 |
Protein GI | 194336994 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0171503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACG CAAAAACAGC AACACAGAAT GCCTGCAAAC TCTGCAACCC GCTTGGAGCA TGTCTTGCCT TCCGGGGCAT AGAGAATTGC GTACCGTTCC TGCACGGTTC ACAGGGGTGT GCCACCTATA TACGGCGGTA CCTGATCAGC CATTACAAAG AGCCGATCGA TATTGCCTCA TCGAACTTTC ATGAAGAAAC CGCCGTCTTC GGCGGCAGCC ATAACCTGAA AATCGGACTG AAAAACGTCT CTGATCAGTA CAAACCAGAG GTTATCGGGG TGGCAACCAC CTGCCTGAGC GAAACCATCG GCGATGATGT GCCCGGAATT TTGCGGGAAT ATAAAAAGGA GTTCAAAAAC GGCACACCAA TGCCGATACT GATTCACGCC TCAACGCCGA GCTACCAGGG CAGTCACATT GACGGTTTTC ATGCCGCAGT CAGGGCGACC GTCAAAACGC TGGCCGAGAA GGGCGAAGAG CAGAGCCTGA TCAACCTCTT TCCGAACATG GTCTCGCCAG CCGACCTGCG CTATCTCAAG GAGATCTTCG CTGATTTCAA GGCTCCGGTG ATGTTGCTGC CCGACTATTC CCAGACCATG GATGGCGGCC CCTGGGGCGA ATATCATCGC ATACCGCCTG GCGGCACTCC GGCAAGCGCT ATTGTATCGG CAGGAAGCGC CGCAGCAAGC ATTGAGTTCG GTTCAACCCT TGAAGCATCG AAATCGGCTG CCGGCTATCT TGAAGAGGCG TTTGATGTGC CAAGATATCA TCTCTCCCTG CCCATCGGCA TCAAGGAGAG CGACAAATTT TTCAGCCTGC TTGAAACACT GACCGGCAAG GCTCGGCCCG ATAAATATGA CGATGAACGA CGCAGGCTGA TTGACGCCTA TGCCGACGGC CATAAATATG TTTTCGAGAA AAAGGTAATT CTCTACGGTG AAGAAGACCT TGTGGTTGCC ATGACCGCAT TTCTCACTGA GATCGGTATG ACGCCCCTCC TCTGTGCTTC GGGAGGAAAA AGTGGTCTTC TAAAAAAAAG GATCAGGGAG CTGATCCCCA CAATGGATGA ACTCGGTATC AAGGTACGTG AAGGGGTTGA CTTTGTCGAT ATCGAGGATG AAGCCAAAAT ACTGAAACCC GATTTTCTTA TCGGTAACAG CAAGGGCTAT ACCATGTCAA GAAAAAACAA CATCCCCTTG CTTCGGCTCG GTTTCCCCAT TCACGACCGT TTCGGAGGAC AAAGAATGCA CCATCTTGGG TACAGGGGAA CCCAGGAACT CTTTGACCGG ATCGTCAACA CCGTTATCGA AGAGCGGCAG AATGCTTCAT CAATCGGTTA CACTTATATG TAA
|
Protein sequence | MKHAKTATQN ACKLCNPLGA CLAFRGIENC VPFLHGSQGC ATYIRRYLIS HYKEPIDIAS SNFHEETAVF GGSHNLKIGL KNVSDQYKPE VIGVATTCLS ETIGDDVPGI LREYKKEFKN GTPMPILIHA STPSYQGSHI DGFHAAVRAT VKTLAEKGEE QSLINLFPNM VSPADLRYLK EIFADFKAPV MLLPDYSQTM DGGPWGEYHR IPPGGTPASA IVSAGSAAAS IEFGSTLEAS KSAAGYLEEA FDVPRYHLSL PIGIKESDKF FSLLETLTGK ARPDKYDDER RRLIDAYADG HKYVFEKKVI LYGEEDLVVA MTAFLTEIGM TPLLCASGGK SGLLKKRIRE LIPTMDELGI KVREGVDFVD IEDEAKILKP DFLIGNSKGY TMSRKNNIPL LRLGFPIHDR FGGQRMHHLG YRGTQELFDR IVNTVIEERQ NASSIGYTYM
|
| |