Gene Franean1_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1176 
Symbol 
ID5669589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1400042 
End bp1401055 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content75% 
IMG OID641240108 
ProductHAD family hydrolase 
Protein accessionYP_001505536 
Protein GI158313028 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00246579 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.129694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCTTCT CCTGGTTCGC CCCGGACGCG GTGCCGGCCG TCGACGGGAC ACCCCTCCCG 
GACGACCGCA AGCCGCTCGC GGCCGTGGTG TTCGACTGGG GCGGGACGCT CACCCTCTTC
CACGACGTCG ACCTGCTCGA CCTGTGGCGG ATGACCGCGC TGGAGATCTC GTCCGCACAC
GCCGACGAGA TCACCGCGAT CCTGGCTGGC CTGGAGACGA CCTGGTGGCA TGCCCGGGAC
GGCAGCGACG GCCTGGCCGG CAACGGCGCG GCGGGCGGTA TGACCGGGTC GGTCACCGAA
CTGCTTGCGG CCGCCTCCGC GGCCCTCGGC GTGGACGTGG CCGCGGCCGT GCTGCGCGCG
GTCAGCCGGC GCGACCTCGC AGCCTGGACC CCGCACACGG TGTGCGATCC CGAGGCACTG
CTGCTCGTGC ACCTCGTGCG TGCCCGCGGC CTGCTCGTCG GCCTGCTCAC CAACACCCGC
TGGCCACGGG CCTGGCACGA GCGCCTACTC GAACGCGACG GCCTGCTCGA CCTCTTCCAC
GCCAGGATCT ACACGAGCGA CCTCCCTTTC GACAAGCCAC ACCCGATCGC CTTCCAGGCG
GTGCTGGCCG CACTGGGCGT CACCGACCCG TCCCGGGTGC TCTTCGTCGG CGACCGGCTG
CGGACGGACA TCCGCGGCGC GCGGGCGGCC GGCATGCGGG CTGTGCTGGT CGCGGACGGT
GGCTTCGCGG ACGGCGGATT GGCGGACGGC GCCGCGGCGG GCCTCGGCGG TGGTCCCGGC
GGCGTGGTCC TCGATCCGCG TGATCCCCTT GGGCTGGCGG TGGCGGGCCG CGATCTCGCA
GGGCGGGCCG ACCTGGGCCG AGACCCCGCC GCGCGGGATC TGCCCGGTGC GGACGGGGCC
GGGGCGTCCG CGTCGATGCT CGCGGATGCG GTGATCGGCC GCCTCGGCCA CCTGCTCGAG
GTCCTCGACC TGCTCGGCGC CCCCGGGCCT GGGTCACCCG CCCCACGTCG CTGA
 
Protein sequence
MGFSWFAPDA VPAVDGTPLP DDRKPLAAVV FDWGGTLTLF HDVDLLDLWR MTALEISSAH 
ADEITAILAG LETTWWHARD GSDGLAGNGA AGGMTGSVTE LLAAASAALG VDVAAAVLRA
VSRRDLAAWT PHTVCDPEAL LLVHLVRARG LLVGLLTNTR WPRAWHERLL ERDGLLDLFH
ARIYTSDLPF DKPHPIAFQA VLAALGVTDP SRVLFVGDRL RTDIRGARAA GMRAVLVADG
GFADGGLADG AAAGLGGGPG GVVLDPRDPL GLAVAGRDLA GRADLGRDPA ARDLPGADGA
GASASMLADA VIGRLGHLLE VLDLLGAPGP GSPAPRR