Gene Franean1_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3381 
Symbol 
ID5671752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4006202 
End bp4007215 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content69% 
IMG OID641242269 
Productluciferase family protein 
Protein accessionYP_001507689 
Protein GI158315181 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03560] probable F420-dependent oxidoreductase, Rv1855c family
[TIGR03621] probable F420-dependent oxidoreductase, MSMEG_2516 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC CGAGCGAGGC TCATCCGTCA CCGGCGCGGC CGATCCGGTT CAACACCGGC 
CCGGGCCGGA TCTCCGACCT CGCGGCGCTC CGGGAGGCTG GGCAGGCCAT CGAGGGTCTC
GGCTACTCGA CCTTCGCTCT CGCCGACCAT TTCATGATCC GGTACGCTCC GCTGATCGCG
CTCCAGGCGG TCGCCGACGC GACCAGCACG CTGCGGCTGA CCCAGACTGT CCTCAACCAG
GATCTACGGC ATCCCGCCGT CCTCGCCAAG GAACTCGCCA CCCTGGACGT GCTGTCCCAG
GGGCGTCTGC AGGTGGGGCT CGGCGCGGGA TGGATGCAGG CCGAGTACCA ACAGGCCGGC
ATCCGGTACG ACCCGGCCGC CGCGCGGATC GCGCGGCTCG AGGAAGTGGT CATCATCCTG
AAAGGCCTGT TCGGAGATGA TCCGTTCAGC TACTCAGGCG CGAACTTCAC GATTGATGCT
CTTCGTGGCA CCCCGCGGCC TCTGCAGCGT CCGCACCCGC CGATCATGAT CGGCGGCGGT
GGCCGCAAGC TGCTCTCGGT CGCCGGGCGC CATGCCGACA TCGTGCAGAT CATGCCCCGG
CTTCCGCAGG AGGTCCGGCC GGCCGAACCG CACCCGTTCA GCGGCGAGGC CTACGAGGAG
AGAATCGGCT GGGTCCGCGC TGCCGCCGGG GACCGCTTCG GCGACATCGA GCTGGGAGCC
CAACTGCTGA ACGTGACGAT CACCGATGAT CCGGAAGCGG CGTTTGAGGC CTGCTTTCAG
AGCTTTGGCC GGCAGGTCCG AGGATCGTCC GGGGGCGCCG TCCCGTCGCG AGCGGACCTC
GGCTCGTCGC CGATGGTGGC CATCGGTTCG CTGGACGACG TCTGCCGGAA AATCCTGGAC
ATCCGTGACC GGTTCGGGAT CAGTTACTTC ACAACGCCGC TCGGTGCGAG CCCCGAATCC
TTCGCACCGG TCGTGGAACG GCTGGCGGAC GCGCCAGCCG GCGCCGCGGC GTGA
 
Protein sequence
MGEPSEAHPS PARPIRFNTG PGRISDLAAL REAGQAIEGL GYSTFALADH FMIRYAPLIA 
LQAVADATST LRLTQTVLNQ DLRHPAVLAK ELATLDVLSQ GRLQVGLGAG WMQAEYQQAG
IRYDPAAARI ARLEEVVIIL KGLFGDDPFS YSGANFTIDA LRGTPRPLQR PHPPIMIGGG
GRKLLSVAGR HADIVQIMPR LPQEVRPAEP HPFSGEAYEE RIGWVRAAAG DRFGDIELGA
QLLNVTITDD PEAAFEACFQ SFGRQVRGSS GGAVPSRADL GSSPMVAIGS LDDVCRKILD
IRDRFGISYF TTPLGASPES FAPVVERLAD APAGAAA