Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4431 |
Symbol | |
ID | 3912246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5021437 |
End bp | 5022402 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886336 |
Product | coenzyme F420-dependent N(5),N(10)-methenyltetrahydromethanopterin |
Protein accession | YP_488028 |
Protein GI | 86751532 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCA CGCTCGGCCT CTCTTTCGAC GGCGGCGAGA CCTCCGACTC ATTTCGCGCG ATGATCGAAC TGGGCGATCG CGGCGGCGCC TCGACGGCCT GGCTCGCCTC GCATTTGTTC CAGCGCGAAC CGATCTCCTC GGCTGCGATC GCACTCGGCG CCACCAGCCG GATCAGCATC GCCCTGATGG CGATGAGCCC GTATTCGGTG CATCCGCTCT ACGCCACCAT GGCCGCGGCG ACGCTGGACG AGTATTTTCC CGGCCGCGTC AAACTTTGCT TCGGCGTCGG CGCGCCGCGC GATCTCGAAG CTGCGGGCCT CGTCGCCGAG CATCCGCTCG GCACCCTGCG CGAGGCGATC GCGCTGTCGC GTGCGCTGCT CGGCGGCGAA ACGGTCGATT TCAAAGGTGA GCGCTTCAAG GTCTCGGGCC GACGGCTGTC GACCGGCGCG CGCGCCGTCC CGATCTATCT GGCCGCCTCG GGCCCGCAGA TGCTCGAACT CGCCGGCGCC GCCGCCGACG GCGTGCTGAT CAGCGCGGCG ACCTCGCCGG CTTTCATCCG CTGGACGCTC GATCTCGTCC GCAAGGGCGA AGAGAAGGCC GGCCGGGTCA TCAAGAAGAC GGCGCTCGTC TATGTTTCGG CCGATGCCGA CGAGACCACC GCCCGCGACC GCCTGCGCCG CACCCTCGGT TTCATCCTGC GCGGCCAGCA CCATGCCCGC AATCTCGAAC TCGCGGGCAC GAAGCTCGAC CAGGCCGCGC TCGCCGCGGC CTATGCGCGC GAAGACTGGG ACGCGGTGAA CGCGCTGGTG ACGGACGACG TGGTGATGCG CCACAGCGCC AGCGGCACGC CGGAGCAGGT CCGTGCGGCG TTCGCGGCGT ATGAGGATGT CGGCGTCGAC GAGATCGTGG CGTCCGGCAT GGGCACCCCC GCGGAGCTGC GGCAACTCCT CGAGGCGCTC GAATAG
|
Protein sequence | MTSTLGLSFD GGETSDSFRA MIELGDRGGA STAWLASHLF QREPISSAAI ALGATSRISI ALMAMSPYSV HPLYATMAAA TLDEYFPGRV KLCFGVGAPR DLEAAGLVAE HPLGTLREAI ALSRALLGGE TVDFKGERFK VSGRRLSTGA RAVPIYLAAS GPQMLELAGA AADGVLISAA TSPAFIRWTL DLVRKGEEKA GRVIKKTALV YVSADADETT ARDRLRRTLG FILRGQHHAR NLELAGTKLD QAALAAAYAR EDWDAVNALV TDDVVMRHSA SGTPEQVRAA FAAYEDVGVD EIVASGMGTP AELRQLLEAL E
|
| |