Gene Franean1_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1749 
Symbol 
ID5670151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2098217 
End bp2099317 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content77% 
IMG OID641240670 
ProductHAD family hydrolase 
Protein accessionYP_001506093 
Protein GI158313585 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01457] HAD-superfamily subfamily IIA hydrolase, TIGR01457
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0343675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000628636 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGTCC CCCGAGAGGC GGCGGGCACG GCGTGGTCCG TGCTCGCCGG AACGAGCCGG 
CCACTGGTGG CGATGTTCGA CGTCGCGTTG ATGGATCTCG ACGGGGTGGT GAACCGCGGC
GAGCGGGCTG TGCCGCACGC CGCCGCGGCC ATCGAGGCCG CGGGCCGGCA GGGGATGCGC
ACCGTCTACG TGACGAACAA CGCGCTGCGG ACCCCGGAGA CCGTCGCCGC GCGGCTGACG
GGTTTCGGCG TGCCAGCCGA ACCGCCGGAG GTCGTCACCT CGGCGCAGGC GGCGGCGCAC
GTGCTCGCCG AACGGCTACC GGCCGGGGCG GTGGTCCTGG TCGCCGGAGG TGTCGGGCTC
CGGGAGGCGG TGCGCGCGGA GGGCCTGGTC CCGACCGGGT CGGCCGCGGA CGAGCCGGCC
GCCGTGGTCC AGGGCTTCGA TCCGGAGATC AACTATGCCC GGCTGGCCGA GGCGGTGCTG
GCGATCCGGG CGGGGGCGTG GTGGGTCGCG AGCAACACCG ACCTGACGGT GCCGACGGAG
CGTGGCCTGG CGCCCGGTAA CGGGGCGCTG GTGGCCTTCG TTCGGGCCGC GACGGGCGCG
GAGCCCGAGG TGACCGGGAA ACCGGAGTTC GCGATGCACG CGGAGTCGGT GCGGCGCAGC
GGCGCGCGTG ATCCGATCAT CGTCGGCGAC CGGCTGGACA CCGACATCGA GGCGGGTTTC
CGTGCCGGCA CGCCGACTCT GCTGGTGTTC ACCGGTGTCA CCGGGCCCGC GGAGCTGCTC
GGCGCGCCCG CGCGGCACCG GCCGACCTTC CTCGCCGCCG ACCTGCGCGG GCTGCTGCGC
CCCCAGCCCG CCGCGCTCGC CCGGGACGGC TCATCCCGGT GCGGCGGGTG GACGTGCGAC
CTGGACGGCG GCACCCTGCG CTGGCACCAG GCCGACCCCG GGAGCGCCGG GCTGGACGAC
GGGCCGGACG ACGGGCTGGA CGCGCTGCGG GCCGCGTGCG CGCTGGTCTG GGCGGCAGCG
GACGAGGGCC GTCCGGTCGA GGCCCTGGCC ACTGATCGGC CGCCGGGCTG CGAGGACCTG
CGCGCGCCGG CCGCGCGCTG A
 
Protein sequence
MTVPREAAGT AWSVLAGTSR PLVAMFDVAL MDLDGVVNRG ERAVPHAAAA IEAAGRQGMR 
TVYVTNNALR TPETVAARLT GFGVPAEPPE VVTSAQAAAH VLAERLPAGA VVLVAGGVGL
REAVRAEGLV PTGSAADEPA AVVQGFDPEI NYARLAEAVL AIRAGAWWVA SNTDLTVPTE
RGLAPGNGAL VAFVRAATGA EPEVTGKPEF AMHAESVRRS GARDPIIVGD RLDTDIEAGF
RAGTPTLLVF TGVTGPAELL GAPARHRPTF LAADLRGLLR PQPAALARDG SSRCGGWTCD
LDGGTLRWHQ ADPGSAGLDD GPDDGLDALR AACALVWAAA DEGRPVEALA TDRPPGCEDL
RAPAAR