Gene Franean1_5363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5363 
Symbol 
ID5673697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6467511 
End bp6468956 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content67% 
IMG OID641244221 
Productluciferase family protein 
Protein accessionYP_001509627 
Protein GI158317119 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0431] Predicted flavoprotein
[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03566] FMN reductase, MsuE subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTTCG GCGTTTTCAG CATCGGTGAT GTCAAGGCCG ACCCCACCAC GGGCCGGACA 
CCGACCGAGC ACGAGCGGAT CAAGTCGATG GTCGCGACCG CGGTCAAGGC CGAAGAGGTC
GGCTTCGATG TCTTCGCGAC CGGCGAGCAC CACAACCAGC CGTTCGTCTC CTCCTCGCCG
ACCACCCTGC TCAGCCACAT CGCCGCGCGG ACCGGCCGGA TCGTCCTGTC GACGTCGACC
ACGCTGATCA CCACCAACGA CCCAGTGAAG ATCGCCGAGG ACTACGCGGT GCTCCAGCAT
CTCGCCGACG GACGCGTGGA CCTTATGATG GGCCGCGGCA ACACCGGTCC GGTTTACCCC
TGGTTCGGCA AGGATGTCCG CCAGGGCATC CCGCTCGCCA TCGAGAACTA CGCGCTGCTG
CACCAGCTGT GGCACGAGGA GGTCGTCGAC TGGCAGGGCC GCTTCCGCAC CCCCCTCCGG
GGGTTCACCT CAACGCCGCG GCCGCTGGAC GGCATCCCGC CGTTCGTCTG GCACGGCTCC
GTCCGCAGCC CCGAGATCGC CGAACAGGCC GCCTACTACG GCGACGGTTT CTTCGCGAAC
AACCTCTTCG GGCCCAAGGA GCACTTCCAG AAGCTGATCA ATCTCTACCG CGAGCGCTAC
GCCCACTACG GTCACGGCAC CGAGGAACAG GCCATCGTCG GGCTCGGTGG CCACGTCTTC
CTGCGCAGGA ACTCCCAGGA CGCGGTACGC GAGTACCGCC CCTACTTCGA CAACTCCCCC
GTCTACGGCC ACGGCCCGTC GATGGAGCGG TGCATGGAGC AGACCTCGCT CACCGTCGGC
AGCCCGCAGG AAGTCATCGA CAAGACGCTG ACCTTCCGCG ACCACTTCGG CGACTACCAG
CGCCAGCTGT TCAACATCGA TATGGCCGGC CTGCCACTGA AGACCGTCCT CGAACAGCGC
GACCTGGCCG TCGACATCGC CAACAACCTG GTCACCGGCT TCCCCTCGGC TGCCCTGGCG
GAAGCCGTGG AGGCGGTCAC CTCGACGGAC GGTCTGATCG CCGTCACCCC GGTCTTCTCG
GCCTCGTACA GTGGGCTCTT CAAGTCGTTC TTCGACGTCA TCGATAATGA CGCCCTCACC
GGAAAGGCCG TGCTGGCTGC GGCGACCGGC GGGACGGCAC GACACTCGCT CACCCTGGAG
CACGCGCTGC GTCCGCTCTT CGCCTACCTT CGCGCGCTTG TCGTGCCCAC CGCGGTGTAC
GCGGCCTCCG AGGACTGGGG CGGCAGCGGC GATCCCCTTA CCGACACGCT GCCCAACCGG
ATCGTGCGTG CGGCCGGCGA ACTCGCCGGG CTCATGCGCC AACGCGCGGG CGATGCCGCC
CGAACCACCA ACGACACCGC CGTCGTACCG TTCGAACAGC AATTGGACGC ACTGCGACCC
GCGTGA
 
Protein sequence
MQFGVFSIGD VKADPTTGRT PTEHERIKSM VATAVKAEEV GFDVFATGEH HNQPFVSSSP 
TTLLSHIAAR TGRIVLSTST TLITTNDPVK IAEDYAVLQH LADGRVDLMM GRGNTGPVYP
WFGKDVRQGI PLAIENYALL HQLWHEEVVD WQGRFRTPLR GFTSTPRPLD GIPPFVWHGS
VRSPEIAEQA AYYGDGFFAN NLFGPKEHFQ KLINLYRERY AHYGHGTEEQ AIVGLGGHVF
LRRNSQDAVR EYRPYFDNSP VYGHGPSMER CMEQTSLTVG SPQEVIDKTL TFRDHFGDYQ
RQLFNIDMAG LPLKTVLEQR DLAVDIANNL VTGFPSAALA EAVEAVTSTD GLIAVTPVFS
ASYSGLFKSF FDVIDNDALT GKAVLAAATG GTARHSLTLE HALRPLFAYL RALVVPTAVY
AASEDWGGSG DPLTDTLPNR IVRAAGELAG LMRQRAGDAA RTTNDTAVVP FEQQLDALRP
A