Gene Franean1_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4624 
Symbol 
ID5672969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5514695 
End bp5515615 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content59% 
IMG OID641243485 
Productluciferase family protein 
Protein accessionYP_001508901 
Protein GI158316393 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACTGG GATTTGCTAT GCCGCATCTG CTGAGACTGA AGGCCACATG TCAACCGTGG 
GAAGCTAAAG TGACGGGTGC GGACCAGACG CGTCTCGCCA AGTGGGCCGA GAAGCTTGGC
TACGCCATGA TAAGCGTGCC CGAACACCAC ATCATTCCGA AGACCCATGT CGATCTTTCG
GGGCCGCACT ACTTAAGTGC GTACCCGACC ATGGCCTATC TGGCCGGGGC CACGGAAAAG
ATACGAGTTA ACTCGTGCAT CGCGCTCCTG CCGTTACAGC ATCCCGCCAT CACCGCCAAG
GCTCTCTCGA GCATCGATTG GCTATCGAGC GGCCGCGTCT CCGTCACGTT CGGGGTGGGC
TGGCTGGAAG AGGAGTTTGA AACTCTAGGC GTTCCCTTCC GTGAACGCGG AGCGATGAGC
GAGGAGTACA TTCAGGCGAT CATCGAGCTC TGGACCAAGG AAGAACCGGC GTTCGAAGGA
AAGTATGTCT CTTTCCGGGA CGTCGCGTTC GAGCCCAAGC CCGTCCAGAA ACCACACCCA
CCGGTGTGGT TCGGTGGTGA CGCCGATGCC GTGCTGAGGC GCACCGCCCG CTACGCTTCG
GGCTGGTGGC CATTCCTCAC CAAACCCGAG GACATCCCCG CGAAGATCGA CTTTGTCAAG
TCGCAGCCCG ACTACAACGG CAAGCTTACT GATGTGTTCT ACGGCTTCGC CACCACGCGA
GTCGGTGACG GTCATGTTAT ACAGAAAGAC CCACGCGCTC GGGCAGGCAT GACCAAACAG
GAGATCATAG ACCGGCTCTG CTGGTTCAAG GAGCTGGGCG TGACGATGAG TTCAGTGCCG
ATCCCCAGCG TCAATCACCT CGAGGACTAC CTCGACTACA CCCAATGGGT GGCGGAAGAG
ATCATGCCCG TGGTTGCGTA G
 
Protein sequence
MKLGFAMPHL LRLKATCQPW EAKVTGADQT RLAKWAEKLG YAMISVPEHH IIPKTHVDLS 
GPHYLSAYPT MAYLAGATEK IRVNSCIALL PLQHPAITAK ALSSIDWLSS GRVSVTFGVG
WLEEEFETLG VPFRERGAMS EEYIQAIIEL WTKEEPAFEG KYVSFRDVAF EPKPVQKPHP
PVWFGGDADA VLRRTARYAS GWWPFLTKPE DIPAKIDFVK SQPDYNGKLT DVFYGFATTR
VGDGHVIQKD PRARAGMTKQ EIIDRLCWFK ELGVTMSSVP IPSVNHLEDY LDYTQWVAEE
IMPVVA