Gene Franean1_4651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4651 
Symbol 
ID5672994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5550112 
End bp5552019 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content69% 
IMG OID641243509 
ProductFkbH like protein 
Protein accessionYP_001508925 
Protein GI158316417 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis 
TIGRFAM ID[TIGR01681] HAD-superfamily phosphatase, subfamily IIIC
[TIGR01686] FkbH-like domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCCCG GGCCCGCCGC GCTCCCACCG CTTGACAGGT TGCGTTCGCT GCACGCCGCG 
GGCGGCCTGA CCGGCGAGTA CGGGGTGGTG GCGGGCCTGC TGGCGGAGAT CGCCACCCAG
GAGAACGGCG CGACCGATCT CGTCCGGGCC GGACAGATAC TGACCCGTCT CGACCCGGAC
GAGGTCCTGG AAGCAGGCGG CGCTGTCGAG GTGGTCGCGG TCGCGATCAC CGGACATTCG
ACGATTGGCA ATCTGCTTGC CCCGCTGACC GCCGAATTCG CCCGGCACGG CGTGCTACTC
CGCCCGCGGA TGAGCGATTT CGACGCCTAT CTGCGGGACC TGCGGGAGAC CGACAGCCCC
CTGTACGCGC CCGACGTGAC GCTGGCGTTG TGCGTCCTGG ACGCACAGGT GATCTTCGAC
GAGCTCCCCG CGCCGTGGCT GGTGGAGCAT CTGGAGGCGG CCGCGGCGGG GAAGCTCGCG
CAGCTGGAAC GGCTGGCCGA CCAGTACGTG GAGCACGGTC GGGGAACGCT GGTGCTCAAC
ACGGTCCCAC TGCAGCGCCT GCACACCCAC CAGCTCGTCG ACCATCGTTC GCGGGCCGTC
CTCGGGGTGG CGTGGCGGGA GTTCAACGCC GGCCTGCTCC GGCTGGCCGA GAAACATCGG
CGGGTCGTCG TCATTGATCT CGATCCGCTG ATCGCCGCCG GCGGGCCGGT CAACGAGACC
CGGATGGCCT CGTACGCCAA GGCCCATCTC GGCGAGGAGC TGCTGGCCCG GTACGCGCGC
GAGGTCGCAC ACCTGGGGCG GGCCGTGCGC GGGAAGGCCA AGAAGGTTCT GGTCGTCGAC
GCGGACAACA CCCTTTGGGA CGGCATTCTC GGTGATGACG GGCCGGACGG TATCGCCGCC
GCCACCACCT TCCGTGGCGA GGCCTTCGGG AACTTCCAGC GGGTGGTCGC CCAAATCGGA
TCGCAGGGGG TCTTGCTCGC CGTCAGCAGC AAGAACGACC AGGATCTCCT GCTGCAGGTG
CTTCGTGAGC ACCCGGACAT GCAGCTGCGG GAGAGCGACT TCGTCCGGGT GAACGCGAAC
TGGAATCCCA AGGACGGGAA CCTGCGGGAC ATCGCGAAGG CGCTCAACCT CGGCGTCGAC
AGCTTCGTCT TCGTCGACGA TTCCCCGTTC GAGTGCGGCC TGATCGCCTC CAGCCTGCCG
GGTGTGGCGG TCGTACGTCT CGATGAGGAA CCCGCCCTGC ACATCGAGCG GCTGCTGGCC
GATGGTTGGT TCGACACGCT CGAGCTCACT ACGGAGGACC GCGCCCGGGC CGGCCAGTAC
CGTGCCGACG CCGGCCGCCA GAGCCTGCTG GACAGCGCGT CCTCGACCGA GGAGTACCTG
GCCCAGCTCG GCGTGGCGGT CACCGTCACC CGGATCCGGG AGCACGAGGT GGCCCGGATC
GCCCAGCTCA CCATGCGTAC CAACCAGTTC AATCTCACCA CCCGGCGGTT GCAGCCGGCC
GATGTCGCGG CCCGGACCGA CAGCTCCGAG CACCTCGTCC TCTCGGTCCG CTCCGCCGAC
CGTTTCGGCG ACAACGGGGT CGTCGGCGCG CTGTTCGCCC GGATCGGTCC CGACGGTCTG
CACATCGAGA ACATGCTGCT GAGCTGCCGA GTCTTCGCCC GTGGCATCGA GCAGGTCACG
ATCACCGCCG TTCTGGAGCA CGCTCGGTCC CTAGGCCTGC CGGCCGCCTT CGCGAGCTAC
CGGCCGACCG CGAAGAACAA GAAAGTCCGC GACTTCTATC CATCCATGGG ATTTGCCGTG
CTGGTGGCGG GCGACGAGGA AACGAGCTTC CGGCACGACC TCGCCGACAT CCCCGCCGTC
CCGGAACACC TCCAGCTCAC CGTCGACCTC GAAAGGCCAC AGTCGTGA
 
Protein sequence
MAPGPAALPP LDRLRSLHAA GGLTGEYGVV AGLLAEIATQ ENGATDLVRA GQILTRLDPD 
EVLEAGGAVE VVAVAITGHS TIGNLLAPLT AEFARHGVLL RPRMSDFDAY LRDLRETDSP
LYAPDVTLAL CVLDAQVIFD ELPAPWLVEH LEAAAAGKLA QLERLADQYV EHGRGTLVLN
TVPLQRLHTH QLVDHRSRAV LGVAWREFNA GLLRLAEKHR RVVVIDLDPL IAAGGPVNET
RMASYAKAHL GEELLARYAR EVAHLGRAVR GKAKKVLVVD ADNTLWDGIL GDDGPDGIAA
ATTFRGEAFG NFQRVVAQIG SQGVLLAVSS KNDQDLLLQV LREHPDMQLR ESDFVRVNAN
WNPKDGNLRD IAKALNLGVD SFVFVDDSPF ECGLIASSLP GVAVVRLDEE PALHIERLLA
DGWFDTLELT TEDRARAGQY RADAGRQSLL DSASSTEEYL AQLGVAVTVT RIREHEVARI
AQLTMRTNQF NLTTRRLQPA DVAARTDSSE HLVLSVRSAD RFGDNGVVGA LFARIGPDGL
HIENMLLSCR VFARGIEQVT ITAVLEHARS LGLPAAFASY RPTAKNKKVR DFYPSMGFAV
LVAGDEETSF RHDLADIPAV PEHLQLTVDL ERPQS