Gene Franean1_4154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4154 
Symbol 
ID5672509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4934662 
End bp4935567 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content74% 
IMG OID641243027 
Producthypothetical protein 
Protein accessionYP_001508444 
Protein GI158315936 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03620] probable F420-dependent oxidoreductase, MSMEG_4141 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.238248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.605436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGTAG CGGAGACCCG GCGGCGGCTG GGCAGGTTCG GGGTGTGGGT GGCCCCGTTC 
TCCTTGCTCG AGACGTCGGT GGCGGTGCAG CGCAGACAGT TCGCCCGGAT CGAACATCTC
GGGTACGGCT CCCTGTGGAG TGGGGAGACG CCGCCGGGTG CGCCGGTCGG GGGCCGGGAG
GTGTTCACCC AGCACGGGTT GATGCTCGCC GCGACCGAGC GGATCGTCGT CGGTACGGGC
ATCGCGAACA TCAGCACCCG CACGGCGGGC GCGATGCACA CCGGTGCCGC GACGCTGGCC
GAGGCGTATC CCGGCCGGTT CGTGCTCGGT CTGGGCGGCC AGTCCGGTGA CCGGCCCCTC
ACCCGTCTAC GGGAGTATCT CGACGCGATG GACCACGCCG CGCGGGCGCT GGCGCAGCTG
CCGGCTCCGG CCTATCCGCG TGTTCTCGCC GCGCTCGGGC CACGCGCCCA CGGGCTGGCT
TCCGATCGCG CCGACGGCGT GCACCCGTTC CTGCAACCGG TGGCGCACAC GGCGGCGGCC
AGGGCGGCCG TGGGCCCGGA CCGTCTGGTG ATCCCCCACC AGGCCGTCGT ACTCGAAACG
GACGCGGACG CGGCCCGGGC GCGGCTGCGG GCGATTTTCG CTCTGGGGGT GGGCGCCTCG
GCCTCGCCTT ACACCGCGCA CTACCGGCGG CTCGGCTACA GCGAGGCGGA CCTGGCCGGG
CAGCGCAGTG ACCGGCTGGT CGACGACGTC CTGGCCTGGG GCGACGAGGC TGCTGTCGCG
GCCCGGCTGA TCGCGCATCT CGACGCCGGT GCCGATCATG TGCTTGTGCA CCCGTTCGCG
GCGGACCTTC CCGCGGCGGT CGACCAGCTC GAACGGCTCG CTCCCCTGTT GCGCGACGCA
GCCTGA
 
Protein sequence
MAVAETRRRL GRFGVWVAPF SLLETSVAVQ RRQFARIEHL GYGSLWSGET PPGAPVGGRE 
VFTQHGLMLA ATERIVVGTG IANISTRTAG AMHTGAATLA EAYPGRFVLG LGGQSGDRPL
TRLREYLDAM DHAARALAQL PAPAYPRVLA ALGPRAHGLA SDRADGVHPF LQPVAHTAAA
RAAVGPDRLV IPHQAVVLET DADAARARLR AIFALGVGAS ASPYTAHYRR LGYSEADLAG
QRSDRLVDDV LAWGDEAAVA ARLIAHLDAG ADHVLVHPFA ADLPAAVDQL ERLAPLLRDA
A