Gene Franean1_1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1271 
Symbol 
ID5669684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1528864 
End bp1530831 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content73% 
IMG OID641240203 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001505631 
Protein GI158313123 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.424952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGAC GTAGAGTAGA CGGACGACGT CACGGTCTGT GGACAACAGC GGTTGCGCGG 
CGCGTGTCGC GACTCAACCT CCAGCAACGG GTCGGTCTGC AGATCGTGCT GGACGTCTGC
GCGCTCGCGC TCGGGTTCAT CGCCGCTCAG GTCGGCCGAC TCGACCTGGA CCCGGCCGCG
CTCACCGATC CGGGCTTCTG GGTCATCGTC TTCCTCGCCG TGTGCCTGCT CCACTTCCTG
GGCACGGCGC TGCACCTCTA CCTGGGCCGG TACCGGTTCG GCGGGTTCGA GGAGGTGTTC
GGCATCCTGG TCGCGGTGGC TCTCACCGTC CTCGGGGTGC TCGTGGTGGT GCTGGCGGTC
GGCGTGCCGC GGCCGGTACC GCTCAGCGTG CCGCCGCTGG GCGGGGCGGT CACCCTGGTG
CTGATGCTCG GCATCCGCTA CCTGTGGCGG CTGGCCGAGG AGCGGCTGCG CCGGCCGGCC
CCCGACGCCA CCGAGCCGCT GATCGTCTTC GGCGCGGGCG ACGGCGGGCA GCGGGTGCTC
ACCGCGATGC TGCGCACGCC GAGCAGCCCC TACTACCCGG TCGCGCTGCT CGACGACGAC
CCCCGGACCT GGAACCTGCA GCTGTCCGGG GTGCGGGTCC GCGGCGGCCG GGACGCGATC
GCCGGCGTCG CCGCCTCCAC CGGCGCACGC ACCCTGCTCG TCGCGATCCC GAGCGCGGAC
GCGGCGCTGC TGCGGGAGAT CAGCGCGCTG GCCGAACCGG CCGGTCTCGC CGTCAAGGTG
CTCCCCCGCG TCGCAGACCT GGTGGACGGC ACCGTCGGCG TAGCGGACAT TCGCGATCTC
GACCTCGCCG ACCTGCTCGG CCGCCGGCAG ATCCAGACCG ACATGACGGC CGCCGAGCGC
TACCTCACCG GCCGCCGGGT CCTCGTGACC GGCGCCGGCG GGTCGATCGG ATCGGAGCTG
TGCCGGCAGA TCCACGCCTT CGGGCCGGCC GAACTGATCA TGCTCGACCG GGACGAGTCG
GCGCTGCGCG CCGTCCAGCT CTCGCTGCAC GGCCGGGCGA TGCTCGACGA CGACACGATC
GTCCTCGGCG ACATCCGCGA CACCGAGCTC ATGGCCGCGC TGTTCGCCGC CCGCCGGCCC
GAGGTCGTCT TCCACGCCGC GGCGCTCAAG CACCTCCCGC TGCTGGAGCG CTTCCCGGGC
GAGTCGGTGA AGACGAACCT GTGGGGGACG CTGACCGTCC TGGAGGCCGC GGCCGCCTGC
GGGGTGCGGC GCCTGGTGAA CATCTCGACC GACAAGGCCG CCAACCCGAG CAGCGTGCTC
GGCCACTCCA AGCGGATCAC CGAGCGCCTC ACCGCGCACG TCGCGGGCCA GGCGCCGGGG
GTGCTGGTCA GCGTCCGCTT CGGCAACGTG CTCGGCAGCA ACGGCTCGGT GCTGACCGTC
TTCGCCGGCC AGCTCGCCGC GGGCGGGCCA CTGACGGTCA CCCACCCGGA GGTGACCCGC
TACTTCATGA CCATCCAGGA GGCCGTCCAG CTCGTCCTGC AGGCCGGGGC GCTGGGCTCC
GCCGGCGAGG CGCTGGTCCT CGACATGGGG GAACCGGTGC GCATCGCGGA CGTCGCCCGT
CGCATCGCGG CCCGCGCGCC CGCGCCGGTG GACATCGTCT ACACCGGGCT CGGGGCCGGC
GAGAAGCTGC ACGAGGAACT GCTGGGCGCC GGCGAGTGGG ACTCCCGGCC GCGGCACCCG
CTGATCTCAC AGGTACCGGT ACCGCCGCTG GACCCGGCCG CCGTCCGGGA CATCGACCCG
TACGCGGCAC CGGATCTGAT CCGGGCCACG CTGACCCGGC TGGCCGCCGA ACAGCCCATG
CCGAACGTGC CGCGCCAGAC CAATCCAGGT CAGGACGAGC CGCGCCAGAC CGGGCCGCGT
CAGGACGGGC CGCGTCAGGA CGGACCGACC GAGGCGCGGA CCGGGTGA
 
Protein sequence
MWRRRVDGRR HGLWTTAVAR RVSRLNLQQR VGLQIVLDVC ALALGFIAAQ VGRLDLDPAA 
LTDPGFWVIV FLAVCLLHFL GTALHLYLGR YRFGGFEEVF GILVAVALTV LGVLVVVLAV
GVPRPVPLSV PPLGGAVTLV LMLGIRYLWR LAEERLRRPA PDATEPLIVF GAGDGGQRVL
TAMLRTPSSP YYPVALLDDD PRTWNLQLSG VRVRGGRDAI AGVAASTGAR TLLVAIPSAD
AALLREISAL AEPAGLAVKV LPRVADLVDG TVGVADIRDL DLADLLGRRQ IQTDMTAAER
YLTGRRVLVT GAGGSIGSEL CRQIHAFGPA ELIMLDRDES ALRAVQLSLH GRAMLDDDTI
VLGDIRDTEL MAALFAARRP EVVFHAAALK HLPLLERFPG ESVKTNLWGT LTVLEAAAAC
GVRRLVNIST DKAANPSSVL GHSKRITERL TAHVAGQAPG VLVSVRFGNV LGSNGSVLTV
FAGQLAAGGP LTVTHPEVTR YFMTIQEAVQ LVLQAGALGS AGEALVLDMG EPVRIADVAR
RIAARAPAPV DIVYTGLGAG EKLHEELLGA GEWDSRPRHP LISQVPVPPL DPAAVRDIDP
YAAPDLIRAT LTRLAAEQPM PNVPRQTNPG QDEPRQTGPR QDGPRQDGPT EARTG