Gene Franean1_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0487 
Symbol 
ID5668907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp570804 
End bp571682 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content67% 
IMG OID641239417 
Productluciferase family protein 
Protein accessionYP_001504855 
Protein GI158312347 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03619] probable F420-dependent oxidoreductase, Rv2161c family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.372075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACT CTATGGTACG AGGCATGGAT ATAGGGATCT TCACCGGTAT CACCGACGAG 
CAGATCAGGC CGGCCCTGCT CGCACGGGCA GTCGAGGAGC GGGGGTTCGA GTCACTGTTC
GTCGCCGAGC ACACCCACAT CCCGGTCCGC CGGGAGACGC CGTATCCCGA AGGTGGCGAC
CTTCCCCGCG ACTACTATCG CACCCTCGAT CCCTTCATAA GCCTGACGAC CGCCGCGGCC
GTGACGACCC GATTGCGACT CGGCACCGCG ATAGCGCTGG TGGTACAGCG GGATCCGATC
CTGTTGGCGA AGGAGACCGC CACCCTCGAC CTGGTCAGCG ACGGCCGATT CGAGCTGGGC
ATCGGCGCCG GCTGGCTGCG CGAGGAGATG CGCAACCACG GCACCGACCC GGAAACCCGG
GTGCCGCTGA TGCGGGAACG GCTGGCCGCG ACGAAAGCGC TCTGGACGTC GGAGCAGGCG
GAGTTCCACG GTCGCTTCGT CGACTTCGAT CCGATCTTCC AATGGCCGAA ACCGGTGCAG
CGGCCGCATC CACCGGTGTG GATCGGAGGC TGGGGTCCGA CCACATTCCA CCGGATCGTC
ACCGACGGCG ACGGCTGGCT CGCTCCTCCC ATACCGGTCG ACGCCTTGGC CCGCGGGGTC
GAGGAACTAG CCGAGGTGGC GAACCGGGCC GGGACGGCCG CACCACCGGT GACCGCGATC
CTGCTGAACC CTGACGAGGC AGCGACCGAG AAAGCTGCGC TCCTCGGTGT CCGTCGCATC
CTGTTCGGGC TGTTTCCCGT CACCGATGCC GACACGACAC TGCGCACTCT GGACCACCTG
GGCACCCTGG CCCAGCGCAC GGTCCTGGCT CGGGGCTAA
 
Protein sequence
MSDSMVRGMD IGIFTGITDE QIRPALLARA VEERGFESLF VAEHTHIPVR RETPYPEGGD 
LPRDYYRTLD PFISLTTAAA VTTRLRLGTA IALVVQRDPI LLAKETATLD LVSDGRFELG
IGAGWLREEM RNHGTDPETR VPLMRERLAA TKALWTSEQA EFHGRFVDFD PIFQWPKPVQ
RPHPPVWIGG WGPTTFHRIV TDGDGWLAPP IPVDALARGV EELAEVANRA GTAAPPVTAI
LLNPDEAATE KAALLGVRRI LFGLFPVTDA DTTLRTLDHL GTLAQRTVLA RG