Gene Franean1_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3959 
Symbol 
ID5672320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4739732 
End bp4740967 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID641242838 
Productmonooxygenase FAD-binding 
Protein accessionYP_001508255 
Protein GI158315747 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.442261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0366683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACG TCATCGTGGT CGGCGCGCGC TGCGCCGGCT CACCGCTGGC CATGCTGCTC 
GCCCGCCAGG GCCACCGGGT CCTGGTCGTG GACAGGTCGA CCTTCCCCAG CGACACCGTG
TCCACCCACT ACATGCACCA GACCGGCCTG GCCCGGTTGC GGGACTGGGG TCTGCTGGAC
CGCCTGGTCG CCACCGGCGT CCCGCCGATG CGCCACCTCA CCTTCTCCTA CACCGGCCTG
CACCTCGAGG GCTTCGCCGA CCCGATCGAC GGGATCACGG AGGTCTACTC ACCGCGTCGG
ATCATCCTCG ACAAGCTGCT GGTGGACGCC GCCCGCGAGG CCGGCGCCGA GGTGATCGAG
GGCTTCACGG TCAGCGACCT GCTCTTCGAC GATGGCCGGG TCACCGGCAT CCGCGGCCGG
ACCGGCGAGG GACCCGAGCA GGAGTTCCGC GCCGCGTTCG TGGTCGGGGC GGACGGGCGG
ACGTCCACCG TGGCCGACAA GGTCGGCGCG GACTTCTACC GGGTCGTGCC GGCGGCCGGC
TTCATCTACT ACTCGTACTT CGAGGGCCTC GACTGGACGT TCCAGCACCG GACCGGCTTC
GGGGAGCAGC AGTTCGGCGC CTGGCCCACG CACGACGGCC GGCACCTGGT GTCGATCATC
CGCCCCCGCT CGGCGTTCAG CGAGTTCCGC GCCGACGTCG AGGGCAGCTT CCAGGCCATC
TTCGACGCGG TCGTCCCCGA GCTCGGCGAG GACCTGCGGA CCCGCGGCCG CCGCGTCGAG
GAGTTCCGTC CGATGCGCTA CCCGGACAAC TACTACCGGC GCTCGCACGG GCCCGGCTGG
GCGCTGGTCG GCGACGCCGG CTACCACAAG GACCCGTTCA CCGGCTGGGG TATCACCGAC
GCGTTCCTCC AGGCGCAGAC GCTGGCGGAC CGGCTGCATT CCGGCCTCGC CGGCGAGCGG
ACGCTGGACG ACGCCGCCGC CGAGTACGTC AAGATCCGCG ACGAGGAGAG CCACGGGACG
TTCGAGCTGA CCTGCACGCT CTCCCACCTC GTGCTGCCGC CGTTCCTGCA CTCGGCCTTC
GCCGCGACGG CGCAGAGCCC CCGCTACACG AAGAAGTTCT TCGGGTTGAT CGCCGGTGGC
GTTCCCGGCC ACGACTTCTT CCACCCCGAC AACCTCGCGG AGCTCTACGA GGAGGTCGGC
ATGCCCGCCG AGAAGCGCCT GCTGTCGGCC AGCTGA
 
Protein sequence
MYDVIVVGAR CAGSPLAMLL ARQGHRVLVV DRSTFPSDTV STHYMHQTGL ARLRDWGLLD 
RLVATGVPPM RHLTFSYTGL HLEGFADPID GITEVYSPRR IILDKLLVDA AREAGAEVIE
GFTVSDLLFD DGRVTGIRGR TGEGPEQEFR AAFVVGADGR TSTVADKVGA DFYRVVPAAG
FIYYSYFEGL DWTFQHRTGF GEQQFGAWPT HDGRHLVSII RPRSAFSEFR ADVEGSFQAI
FDAVVPELGE DLRTRGRRVE EFRPMRYPDN YYRRSHGPGW ALVGDAGYHK DPFTGWGITD
AFLQAQTLAD RLHSGLAGER TLDDAAAEYV KIRDEESHGT FELTCTLSHL VLPPFLHSAF
AATAQSPRYT KKFFGLIAGG VPGHDFFHPD NLAELYEEVG MPAEKRLLSA S