Gene RPB_4431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4431 
Symbol 
ID3912246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5021437 
End bp5022402 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content69% 
IMG OID637886336 
Productcoenzyme F420-dependent N(5),N(10)-methenyltetrahydromethanopterin 
Protein accessionYP_488028 
Protein GI86751532 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA CGCTCGGCCT CTCTTTCGAC GGCGGCGAGA CCTCCGACTC ATTTCGCGCG 
ATGATCGAAC TGGGCGATCG CGGCGGCGCC TCGACGGCCT GGCTCGCCTC GCATTTGTTC
CAGCGCGAAC CGATCTCCTC GGCTGCGATC GCACTCGGCG CCACCAGCCG GATCAGCATC
GCCCTGATGG CGATGAGCCC GTATTCGGTG CATCCGCTCT ACGCCACCAT GGCCGCGGCG
ACGCTGGACG AGTATTTTCC CGGCCGCGTC AAACTTTGCT TCGGCGTCGG CGCGCCGCGC
GATCTCGAAG CTGCGGGCCT CGTCGCCGAG CATCCGCTCG GCACCCTGCG CGAGGCGATC
GCGCTGTCGC GTGCGCTGCT CGGCGGCGAA ACGGTCGATT TCAAAGGTGA GCGCTTCAAG
GTCTCGGGCC GACGGCTGTC GACCGGCGCG CGCGCCGTCC CGATCTATCT GGCCGCCTCG
GGCCCGCAGA TGCTCGAACT CGCCGGCGCC GCCGCCGACG GCGTGCTGAT CAGCGCGGCG
ACCTCGCCGG CTTTCATCCG CTGGACGCTC GATCTCGTCC GCAAGGGCGA AGAGAAGGCC
GGCCGGGTCA TCAAGAAGAC GGCGCTCGTC TATGTTTCGG CCGATGCCGA CGAGACCACC
GCCCGCGACC GCCTGCGCCG CACCCTCGGT TTCATCCTGC GCGGCCAGCA CCATGCCCGC
AATCTCGAAC TCGCGGGCAC GAAGCTCGAC CAGGCCGCGC TCGCCGCGGC CTATGCGCGC
GAAGACTGGG ACGCGGTGAA CGCGCTGGTG ACGGACGACG TGGTGATGCG CCACAGCGCC
AGCGGCACGC CGGAGCAGGT CCGTGCGGCG TTCGCGGCGT ATGAGGATGT CGGCGTCGAC
GAGATCGTGG CGTCCGGCAT GGGCACCCCC GCGGAGCTGC GGCAACTCCT CGAGGCGCTC
GAATAG
 
Protein sequence
MTSTLGLSFD GGETSDSFRA MIELGDRGGA STAWLASHLF QREPISSAAI ALGATSRISI 
ALMAMSPYSV HPLYATMAAA TLDEYFPGRV KLCFGVGAPR DLEAAGLVAE HPLGTLREAI
ALSRALLGGE TVDFKGERFK VSGRRLSTGA RAVPIYLAAS GPQMLELAGA AADGVLISAA
TSPAFIRWTL DLVRKGEEKA GRVIKKTALV YVSADADETT ARDRLRRTLG FILRGQHHAR
NLELAGTKLD QAALAAAYAR EDWDAVNALV TDDVVMRHSA SGTPEQVRAA FAAYEDVGVD
EIVASGMGTP AELRQLLEAL E