Gene Franean1_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4940 
Symbol 
ID5673279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5930417 
End bp5931808 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content74% 
IMG OID641243794 
Productputative secreted protein 
Protein accessionYP_001509210 
Protein GI158316702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG TGGGAACGCC GTCCGCAGCA CCTGCCGCCG GTAACCCGGC TCCCGTGTCC 
GGCTCGGCGA CTGCTGTCTC CACGGTTTCG GGCGCCGCGG CCTCCGTCGC CGCGGCGGCG
GGGACGATCC GGCGACGGAG CCGGACTCCA CCCGGATTCC TGCGCCTGCT GTCCGCCGGG
CTCGTAGGTG TCCTGATGGT GACGTTGTCC GTCTGCCTGC TGTCCACCCT CTCCCGCCAG
CACGCCGTGG ACGCCCTCGC CCGCGACTCC GGTGCGTCGT TCGTGGCGGC GCAACAGCTG
CACGCCGAGC TCTCGGTGGC CGACGCGACC GTGGCCCGCG CGTTTCTGGC CGGCGGCGTG
GAGCCGCCGG CCCAGCGGAC GGCCTACCAG GAGAGCATCG CCTCGGCGAG CGGGCGCATC
GTCGACCTGG CGCTCGCCGG CGGGCCGCGC GAGCCGCTGA GCGTCCTGGC GGCCCAGCTG
CCGGTGTACA CCGGCCTGAT CGAACGGGCG CGCGCGAACA ACCGGATCGG GAATGTCGTC
GGCGGCGCGT ACCTTCGTCA GGCCTCCGAA CTCATGCAGA CCAGGATCCT CCCAGCGGTC
GACCGGCTGG CCGCCGAGGA CGCGCTGGAC ATCGATCGCG GGTACGCCCA GGCGACGCGC
TGGTACCAGC CGGTCCTCGT CGGCGTGGCC GGCGCGGCGG CGCTGGCGGC CCTGGTCGCC
CTGCAGATCC GCCTGTTCCG GCGGACGCAC CGGATGTTCA ATCTGCGCCT GGTCGCCGCG
ACGGTACTGG TCGTGATCGC GACGGGTCTC ACCCTGCTGG CTTTCGGTGT CTCCCGGGCC
CGCCTGGTCG ACAGCAGGAA CGACGCCTTC CGGCCGATGA CGGTCGTCGC CCAGGTGCGG
GTGCTGGCGC TACGGGCCTG GGGCGACGAG AGCCTCTCCC TGATCGCCCG CGGCAACGGC
GACGACCTCG ACGCCGACGC GCGCCGGGTG ACCGAGCGCC TCGGCTACGA CCCGGCGGGC
CGACCCGCCG GCGCGGGAGG GCTCGCCACC GCCGCGGCGC TGGACGGCCC GGACGCGCCG
GGACGGGACG TGCTCGTACC CGACTGGGAG CGCTACCAGG ACACCGCCGT CCGGGTCCGG
GACCTGGTCC GCGACGTCGG CGGCTTCCAG GAGGCCGTAC GGGTGGCCCT CGACGAGGGA
ACCTCGACGT TCACCCGCTT CGACGGCGAC GCCGAGACGG CGTTCACCGC GAGCCGCGAG
CGTTTCGCCG CCGGGCTGAG CTCCGCCGCG GGCACCTACG ACGGTGTCGC CGCCGGTACC
GGCACGGCGC TCGGGCTGGC GATGCTCCTC ACGCTGGCCG GGGTGCAGTC GAGGATCAAT
GACTACCGTT GA
 
Protein sequence
MSTVGTPSAA PAAGNPAPVS GSATAVSTVS GAAASVAAAA GTIRRRSRTP PGFLRLLSAG 
LVGVLMVTLS VCLLSTLSRQ HAVDALARDS GASFVAAQQL HAELSVADAT VARAFLAGGV
EPPAQRTAYQ ESIASASGRI VDLALAGGPR EPLSVLAAQL PVYTGLIERA RANNRIGNVV
GGAYLRQASE LMQTRILPAV DRLAAEDALD IDRGYAQATR WYQPVLVGVA GAAALAALVA
LQIRLFRRTH RMFNLRLVAA TVLVVIATGL TLLAFGVSRA RLVDSRNDAF RPMTVVAQVR
VLALRAWGDE SLSLIARGNG DDLDADARRV TERLGYDPAG RPAGAGGLAT AAALDGPDAP
GRDVLVPDWE RYQDTAVRVR DLVRDVGGFQ EAVRVALDEG TSTFTRFDGD AETAFTASRE
RFAAGLSSAA GTYDGVAAGT GTALGLAMLL TLAGVQSRIN DYR