Gene Franean1_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2019 
Symbol 
ID5670420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2425919 
End bp2427802 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content77% 
IMG OID641240940 
Productketopantoate reductase ApbA/PanE 
Protein accessionYP_001506362 
Protein GI158313854 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1893] Ketopantoate reductase
[COG2897] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00745] 2-dehydropantoate 2-reductase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.691708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.348803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA GATACGTGAT CATCGGAGCG GGTGCCGTCG GGGGGACCGT CGCCGCCCAG 
CTGCACGATG CCGGTGTCGA CGTCGTCCTC GTCGCCCGCG GGGCGAACCT GGCGGCGCTG
CGCGCCGACG GCCTGCGCTA CCTCCGTCCC GACGCCGACC GTCCCGACTG GGACCGGCGC
CTCCCCCTGC CGGTGGCCGG CGGCCCCGAC GAGGTCGACC TGCGCCCCGG CGACGTGCTG
GTGCTCTCCG CCAAGTCCCA GGACACCGAG GCGCTCGTCG CCCAGTGGGC GTGGCGGCCG
GTGCACCCGG GCGGCACCGC CGCCGAACAG CTGCCGATCC TGCTGCTGCA GAACGGGGTG
GAGAACGCCC GGGCCGCGCT GCGCCGGTTC GACACCGTCA TCGACGCGGT CGTGATCATC
CCGTCATCCC ATTCGACGCC CGGAACGGTG GTCTCGCCGG GCGCCCCGCT GGCGGGCGCC
TTCTACCTGG GACATGCCCC GCGCGGCGAG AGCGAGGCGG CCGAGCGGAT CGCCGCCGAC
CTGCGGCGCG CGCAGTTCGC GGTGAAGGTC GTTCCGGACG TCGAACGGTG GAAGGTCGGC
AAGCTGCTCG GGAACCTCGC CTACAACCTC GACGCCGTCT ACCCGCCCAG CGCGGCCCGC
GACCGGCTCG GCGCGGCGCT GGTCACCGAG GCCCGCACGG TGCTGGCCGC CGCCGGGGTC
GAGGTGACCG ACGTGCTCAC CGGCGACACC GGTCTCGACC TGTCCGGCCT GGTCCTGCAC
GACATCCCCG GCCACTCCCG CCAGGGCAGC TCGACGTGGC AGAGCCTGGC CCGCGGCGCC
ACGGTCGAGT CCGACTTCCT CAACGGCGAG ATCGCGCTCC TCGCCCGCCT GCACGGGACG
TCCGCGCCGC TGAACGCGGG CGTTCAGCGG CGCATCGCCC TCGCCGCTCT GGGCGGGTCG
AGCCCCGGCG GCCTCGGCGA GGACGACCTC ACCGCCCTGC TCGCGTCCGC GTCACCCGCC
GACGGCGACA GTCTCCGTGA GGTGCTCGTC GACGCCAAGC GGCTGCACGA CCTGGTGGCG
ACGGGGCCGG CGCCGGTGCT GCTCGACGTC CGCTGGGCGC TCGGCGACCC GCACGGGCGG
GAGCACTACC TGGCCGGTCA CCTCCCGGGC GCCGTCTACG TCGACCTGGA CACCGAGCTG
GCCGCCGCGC CGGGCGGGAC CGCGGGGCGC CACCCGCTGC CCGCCGTCGA GGACCTGCAG
CGGGCGGCCC GGCGCTGGGG TGTGTCGGCC GGCCGGCCCG TCGTCGTGTA CGACGACAAC
GGCGGGCTGG CCGCCGCGCG GGCCTGGTGG CTGCTGCGCT GGGCAGGCCT CTCGGACGTC
CGCATCCTCG ACGGCGCGCT CGGTGCCTGG CGCGAGGCCG GGTTCCCGCT CGCCACCGGC
GACGTCGTCC CCGCCCCGGG CGACGTGGTG CTCAGTGCCG GTCACCTGCC CACCCTCGAC
GCGGACGGCG CGGCCCGAAC CGCCCGCGAC GGCGTGCTGC TGGACGCGCG CGCCGCGGAG
CGCTTCCGCG GCGAGGTGGA GCCGGTCGAC CCGCGCGCGG GCCACATCCC CGGGGCGGTC
AGCGCGCCCA CCGGCGACAA CCTGGACGAA CATGGCCGTT TCCTCACCCC GGACCGGCTG
CGGGAGCGCT TCGCCGCCCT CGGGACGTCC GCCGGCGGCC AGCCGGTCGG TGGCCAGCCC
GTCGGCGTGT ACTGCGGCTC CGGGGTGACC GCCGCGCACG AGATCGCGGC CCTGGCCACC
GCCGGCATCG AGGCCGCGCT CTACCCGGGC TCGTGGTCGG CCTGGTCCGC CGACCCGCAG
CGCCCCGCCG CCACCGGGTC ATGA
 
Protein sequence
MTARYVIIGA GAVGGTVAAQ LHDAGVDVVL VARGANLAAL RADGLRYLRP DADRPDWDRR 
LPLPVAGGPD EVDLRPGDVL VLSAKSQDTE ALVAQWAWRP VHPGGTAAEQ LPILLLQNGV
ENARAALRRF DTVIDAVVII PSSHSTPGTV VSPGAPLAGA FYLGHAPRGE SEAAERIAAD
LRRAQFAVKV VPDVERWKVG KLLGNLAYNL DAVYPPSAAR DRLGAALVTE ARTVLAAAGV
EVTDVLTGDT GLDLSGLVLH DIPGHSRQGS STWQSLARGA TVESDFLNGE IALLARLHGT
SAPLNAGVQR RIALAALGGS SPGGLGEDDL TALLASASPA DGDSLREVLV DAKRLHDLVA
TGPAPVLLDV RWALGDPHGR EHYLAGHLPG AVYVDLDTEL AAAPGGTAGR HPLPAVEDLQ
RAARRWGVSA GRPVVVYDDN GGLAAARAWW LLRWAGLSDV RILDGALGAW REAGFPLATG
DVVPAPGDVV LSAGHLPTLD ADGAARTARD GVLLDARAAE RFRGEVEPVD PRAGHIPGAV
SAPTGDNLDE HGRFLTPDRL RERFAALGTS AGGQPVGGQP VGVYCGSGVT AAHEIAALAT
AGIEAALYPG SWSAWSADPQ RPAATGS