Gene Franean1_5612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5612 
Symbol 
ID5673939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6810218 
End bp6811153 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content71% 
IMG OID641244465 
Productalpha/beta hydrolase fold 
Protein accessionYP_001509869 
Protein GI158317361 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00676484 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00203627 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCTGAGA TCGTCGCGAA CGGCGTCCGT CTGCATGTGC AGAGGCTGGG GCCCAAGGGT 
GGGGCCGGTC CCGAGTCGCC GGTCGTGGTG ATGGTGCACG GAATGGTGAT GGACAACATC
TCCAGCTTCT ACTTCGCGCT GGGCAACTGC CTGGCCGGAG CGGGATGTGA CGTCATCTGC
TACGACCTGC GCGGCCATGG CCGCAGTGAG CGGACATCGA CCGGCTACAC GATGTCCGAC
TCGATGGCCG ACATCGAGAG CCTGCTCGAC GCGCTCGACG TGCGCCGGCC CGTGCACGTC
GTGGGCAACA GCTACGGCGC GACGCTGACC CTCGCGCTCG GCCTGGCCCA CCCGGAGCGG
GTGGCGAGCC TGACCCTCAT CGAGCCGCCG TTCCTCATCG AGGGCCTCGG CGAGGAGATG
GCGCGCTCAC TGACCCAGGT GCTCGCGGCC GTCACCGACG AGGAGGTGGA GGAGTGGCTG
GAGAACTCCG CCGGCCGCGC GGTCTCCCGG ATCACGCGCG CCTCGCAGGC GCTGCTGAAG
GAGACCACGA TCGCCGAGGA CATGCTGGCG ACCCCGCCGT TCTCGCCGGA GGCGCTGGCC
AGCCTGCCGA TGCCGGTGCT CGCCGTCTAC GGGGCGAACT CGGACATCAT CGACCAGGCC
GAGGGGCTCG CCGAGCTCGT CCCGGACTGC ACCCTGGTCG TCCTGGAGCA GCACACCCAC
ATGGTGCTGC GCGAGGCGGC CGACTACCTG CGCGACCTGC TGCGGTGGTG GCTGTTCCGG
CGGTCCGAGC CGATGCCCTC GCACCAGGTG CGCGGCGCCG GGTTCGACAC TCCCGACTGG
GTGCGGCAGA TGATCCCGCC GCCGAACCTC AACAGTGGGG ACGAGCCCAC GACGTCGCTG
TCCGCGGCGC GGGCCGGTTC GCCGGCCGAG GCCTGA
 
Protein sequence
MAEIVANGVR LHVQRLGPKG GAGPESPVVV MVHGMVMDNI SSFYFALGNC LAGAGCDVIC 
YDLRGHGRSE RTSTGYTMSD SMADIESLLD ALDVRRPVHV VGNSYGATLT LALGLAHPER
VASLTLIEPP FLIEGLGEEM ARSLTQVLAA VTDEEVEEWL ENSAGRAVSR ITRASQALLK
ETTIAEDMLA TPPFSPEALA SLPMPVLAVY GANSDIIDQA EGLAELVPDC TLVVLEQHTH
MVLREAADYL RDLLRWWLFR RSEPMPSHQV RGAGFDTPDW VRQMIPPPNL NSGDEPTTSL
SAARAGSPAE A