Gene Franean1_6431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6431 
Symbol 
ID5674746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7811205 
End bp7813370 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content75% 
IMG OID641245279 
ProductFkbH like protein 
Protein accessionYP_001510674 
Protein GI158318166 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis 
TIGRFAM ID[TIGR01681] HAD-superfamily phosphatase, subfamily IIIC
[TIGR01686] FkbH-like domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0351897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0864936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC CCGCGGGGAC GGGCCGGCAC CGGCTCGCCG AGCTGACGGC CGCCGCGACC 
GCGGCCGAGG TGCCGCCGGC CGGGCTCTGT CGTGATCTCG CGCTCGCCTA CGAGAAGGCG
GGCGATCCGG CCGCCGCCGT GCGCTGGGCA CTGGCCACGA CCGACGCCGG AGGCGACCTG
ACCTGCTGGA CGGTCGCCGC CGGCGTCCTG CGCCGCGTCT CGGCCGGGGC GGTCGCGTCA
GCCGTGTCAC CCGGGGCGGC TGCGTCGGTC GGAGCGGCTG TGCCAGCCCG GGCGGCCGTG
GCCGCCGCTG CGGGAGGGGC GGTCATGCCG TCGCCCGTCG CGGGCCACAC GGCGCGCTCG
GCGCGGGTCG CCGTGCTCGG CAGCTACACG ACCGCCCAGC TCGCGGCGCT CCTGCCGCTC
GCAGCCGCGC GCGCCGGGGT GGCCGTCGAG GTCTACGAGT GCGGGTACGG CCAGTACCGG
CAGGAGGTGC TCGACCCGGG CAGCGGCCTG TACCGCTTCG GGCCGGACGT CGTGATCCTC
GCGGTCCACG AGGGCGAGGC CGCGCTTCCA GCCCTCGCCG AGGACCCCGA GCGCGCGGTC
GCCGCCGAAG TCGGCCGCTG GACATCGCTG TGGGAGATCA TCATCTCCCG GTTGGATGCC
CGCGTCATCC AGCACACCTT CGTCGTCCCG GAGACGGAGG CGCTCGGGCA CCTGGCCCTG
CGCATCCCCG GGTCGCGGTC GTCGATGCTC GCCGCGGTCA ACGCGGCGCT CGGCCGCGCG
GCGGCGGGCG GTCAGGTCGC CTTCGTCGAC TGTGAGCGGC TGGCCGCCAC GGTCGGCAAA
CGATCCTGGT TCGACCCGCG GTACTGGCAC CGCGCGAAGC AGGCCGTCTC GCTCGCGCAC
GTGCCCGCGC TGGCCCGTCA CACCGCGGCG GTGCTCGGCG CGCAGCTGGG CACCAGCCGT
AAATGCCTGG TGCTCGACCT CGACAACACG CTGTGGGGCG GGGTGCTCGG GGAGGAGGGC
CTCGCCGGCA TCGCGCTCGG GGACGGGCCG GTCGGCGAGG CTTTCTCCGC GTTCCAGGAG
TACATCGGCC GGCTCCGTGC CCGCGGCGTG ATCCTCGCCG TCTGCTCGAA GAACAACGAG
GCGGACGCGC GCGAGGCGTT CGAGCGGCAC CCGGCGATGC GGCTGCGGCT GGACGACATC
GCCATGTTCA GCGCCTCCTG GGAGGACAAG CCGACCCAGA TCCGGCGCAT CGCGAGCACC
CTCGGGATCG GTCTGGACTC CCTGGTCTTC GTCGACGACA ACCCGGCCGA GCGCGAGGTG
GTCCGCCAAC TCGTGCCCGA GGTGGACGTC ATCGCCCTGC CGCGGGACCC GCACGGCTAC
GTGCGCGCGG TCGCCGACTA CCCGTTCTTC GAACCGGCCG CCCTGACCAC CGAGGACACC
GCCCGCACGG AGCAGTACCG GGCGCGGGCC CGGGCGGCGG AGCTCGCCGC GTCCGCGACC
TCGCTGGAGG AATTCCACCG CGGGCTGGAG ATGGTCGCGA CCGTCGTACC CCTCGACGAG
CTGACGCTGC CCCGGGTCGT CCAGCTCATC GGGAAGACCA ACCAGTTCAA CCTGACCACC
CGGCGCCGCG GGGAGGCCGA GGTCGCCGAG CTGGCCGCGG ACTCGTCCAC CGCGGTGATC
TGCGTTCGGC TGGCCGACCG GTTCGCCGAC CACGGCCTCG TCGCGGTCGT CATCGCCCGG
CGTGCCGCCG AGCCCGGCGG GGCCGTCCTC GACGTCGACA CCTGGCTGAT GAGCTGCCGG
GTCATCGGCC GGACGCTGGA GGACGAGATC GCCGGGCTGA TCGTGGCGGA GGCCCGGCGG
CTGGGTTGTT CCTCCGTGCG GGGGCACTAC CTGCCGACCG CGAAGAACGC GCTGGTGGCC
GACCTGTACC CGCGGCTGGG CTTCCGGCCG GACCCCGCGG CTCCCGCCGA CGCCGGCCGC
GTCGCCCCGG CCGGTGGCCC CGGTGTGGCT CCCGCCTATG GCACCGGCGC GGGGGCCACC
CGCTGGGTGC TGCCGGTCGC GAGCGCCCTG AACCGGCCCG GGCTCATCCG GGTCGTCGAC
GCTCGCGCGG GCGGGGCCGC TGTCGCGAGC GCATCCGTGC GGACGATGGA GGAGGTGGCC
GGCTGA
 
Protein sequence
MADPAGTGRH RLAELTAAAT AAEVPPAGLC RDLALAYEKA GDPAAAVRWA LATTDAGGDL 
TCWTVAAGVL RRVSAGAVAS AVSPGAAASV GAAVPARAAV AAAAGGAVMP SPVAGHTARS
ARVAVLGSYT TAQLAALLPL AAARAGVAVE VYECGYGQYR QEVLDPGSGL YRFGPDVVIL
AVHEGEAALP ALAEDPERAV AAEVGRWTSL WEIIISRLDA RVIQHTFVVP ETEALGHLAL
RIPGSRSSML AAVNAALGRA AAGGQVAFVD CERLAATVGK RSWFDPRYWH RAKQAVSLAH
VPALARHTAA VLGAQLGTSR KCLVLDLDNT LWGGVLGEEG LAGIALGDGP VGEAFSAFQE
YIGRLRARGV ILAVCSKNNE ADAREAFERH PAMRLRLDDI AMFSASWEDK PTQIRRIAST
LGIGLDSLVF VDDNPAEREV VRQLVPEVDV IALPRDPHGY VRAVADYPFF EPAALTTEDT
ARTEQYRARA RAAELAASAT SLEEFHRGLE MVATVVPLDE LTLPRVVQLI GKTNQFNLTT
RRRGEAEVAE LAADSSTAVI CVRLADRFAD HGLVAVVIAR RAAEPGGAVL DVDTWLMSCR
VIGRTLEDEI AGLIVAEARR LGCSSVRGHY LPTAKNALVA DLYPRLGFRP DPAAPADAGR
VAPAGGPGVA PAYGTGAGAT RWVLPVASAL NRPGLIRVVD ARAGGAAVAS ASVRTMEEVA
G