Gene Franean1_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3942 
Symbol 
ID5672303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4711200 
End bp4712195 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content75% 
IMG OID641242821 
Productferredoxin 
Protein accessionYP_001508238 
Protein GI158315730 
COG category[C] Energy production and conversion 
COG ID[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.51179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0171714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCTCGT CCCCGTCCAC CGTGGATCGA GCCGAACAGC GGCAGACGCT GACGCTGACG 
GTCCGCGCCC GCCGCCACGT CGCCGAGGAT GTCGTCTGTT TCGATCTGGC CGATCCGATC
GGCGCCGCAT TGCCGCCCTG GACACCCGGC GCGCATGTCG ACGTCACGGT GCGGCCGGGG
ACGGTGCGGC AGTACTCGCT GTGCGGTGAT CCAGCGGATC GCCACCATTG GCGGATCGCG
GTGCTGCGGG AGGCCGCCGG CCGCGGCGGT TCGGTGCACC TGCACGACCG GGTCGGTGCC
GGCGCGTTGC TGCCGGTAGG GCAGCCGCGC AACGCGTTCC CCCTGGTCGC CGCGCCGCGC
TACCTGCTGG TCGCCGGCGG GATCGGCGTC ACCCCGCTGC TGCCGATGAT CGACGAGCTC
GCCGCGCGTG GCGCCGAGTG GCGGCTGCTC TACGGCGGGC GCCACCGCGC GGCGATGGCC
TTCGCCGACG ACCTCGCCCG CCACGGCGAC CGGGTCGTCC TGCACCCGCA GGACACCCAC
GGGCTGCTTC CCCTCGGCCC CGTCCTGGAC GGCCTGCGTG CCTCCGGCGA GCACGAGGAG
ACGGCGGTCT ACTGCTGCGG GCCCGAGGGT CTGCTCGGGG CCATCGAGGG GCACTGCGCG
CAGTGGCCCG CCGGCGCCCT GCACGTCGAG CGGTTCCACC CCGCAGAGCC CGCCCACCGC
GACACCGACG GCGCCTTCGA GCTGTGCCTG GCCCGTAGCG GGCGGGTGCT GCGGGTCGGG
CCCGGGCAGT CGGTCCTGGA GGTGCTGGAG GCGGCCGGGG CCGCCGTCAC CTCCTCCTGC
CGGGACGGTA CGTGCGGCAC CTGCGAGACG CCGGTGGTCG AGGGCGGTGT CGACCACCGT
GACACCGTCC TGACCCCGGC CGAGCGCGAC GGCGGCCGGA CGATGATGGT CTGCGTCTCG
CGTGGCCTGG GCGGACGTCT CGTCCTGGAC ATCTGA
 
Protein sequence
MSSSPSTVDR AEQRQTLTLT VRARRHVAED VVCFDLADPI GAALPPWTPG AHVDVTVRPG 
TVRQYSLCGD PADRHHWRIA VLREAAGRGG SVHLHDRVGA GALLPVGQPR NAFPLVAAPR
YLLVAGGIGV TPLLPMIDEL AARGAEWRLL YGGRHRAAMA FADDLARHGD RVVLHPQDTH
GLLPLGPVLD GLRASGEHEE TAVYCCGPEG LLGAIEGHCA QWPAGALHVE RFHPAEPAHR
DTDGAFELCL ARSGRVLRVG PGQSVLEVLE AAGAAVTSSC RDGTCGTCET PVVEGGVDHR
DTVLTPAERD GGRTMMVCVS RGLGGRLVLD I