Gene Franean1_5085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5085 
Symbol 
ID5673420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6085177 
End bp6086601 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content78% 
IMG OID641243936 
Producthypothetical protein 
Protein accessionYP_001509350 
Protein GI158316842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0309888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000681327 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGGCGG ACCCCGGCCC GCCTTCCGCG CAGTCCTCGA GCGAAGGCCC CGAGGCGGTC 
CGCGTGCCCG AGGAACCCGA GCCCGAGAAA CCCACGCCCG TTGAACCCGC GGACGACGCC
GCGGCGGTGA CCGCCTCCGG CCTGCCGCCG GCCAGCGACC CCTCTGCCGG CGCCCCGCCT
TCCGGCCACC AGCCTTCCGG CGACCCGCCC GCCGAGCACA CGCCCGCGGG CGACCCGCCC
GCTGCCGACG GTCCCTCGGG GGGCATTCCC TCGGTCGACG GTCCCTCGGG AGAGGACCCC
CTGCGAGAGG GCATCCTGGA CGAGGACCCC CTGGGCGGCT CTCCTCCGGA CGACGGTGCT
GCCGCTGATC CCTCAGCCGC CGATCCCGCG GATCCCACAG CATCCACCGG GTCCGCGGAT
TCCGCGGATT CCGCCGGGCC CACGACGTTC GTCCGGCCTA CATCGGCCGC CGGGCCCGCG
CCGTCCGCGG GGCCCGCTGA GCCCGCGGGT GACGTGCCCG GCGACGGCTC CCCGCCGGGC
GGGTTGGGAC GCTCCGTGAT CCGACTGCTC GACCGGGTCG ACGCCGCCGC CCTGCCGCCG
GTCGGCCGGG TGCTGGAGTC GGTCGCCGGC GGCCTCGCCG GCGCACGCGA CCGCCCTCGG
GCGCGCCTGC GCCGCGCGTG GGCGGCACGC GTGGGCCCGT ACGCCGACGG CGACGGCCCC
CCGCCACCGG GCCGGGCCAG CGTCATCACC GGGCGCGCCC TCGAGGCCAT CGGCCGGCTG
CTCGTGCTCG GGCTGGTCGT GCTGATCGTC GTCGGTGCCG TGACCACGAT GCTGCGCGGC
GCCGATCCCT CGGGCAGCCA TGCACCCGGC CCCGGTCCCG GCAACGCTCC GGCCGGGCCC
GTCGAGCCCA CGGTCACCGT CGGCCCGGCC GCGGGTGAGT CGCCCGCCGC GTACGCCGCC
GAGGCCGAGG CGAAGCTGGA CGGCCTGACC CGGGCGGCCC CGGACGCCGA CCTCTACGCC
GTCGTCAGCC TGGCCGGCTA CCGCACGCCG GACGAGATGC TGCAGATCTT CACCGCCTAC
CGCACGCTCG AGGTGTTCTT CGGCGTCCCG CCGGACGGGG CGGTGATGGC CGCCACCGTC
CGCGACCCGG TCGCGGACCT CACCGCCGCC TTCGACAGCG AGGCCGACGC CGCCGACACC
CGCGCGCACA GTGCCACGGA CCCCGCCGAG GCCGAGCACG CGCGGCAGGA GGCCACGGCG
CTGCGCGCGC GGTGCGGCTG CCTGTTCGGC GCGGTCGTGC GGGCGCCGGC GGCCCGGCTG
GTCGACCTCG GACACATCGA GGGGGTGCGG GTCGTCGACC CGGCCCCGCC CGGCATCTCC
CCGGAGACCG TGCGCTTCCT CCCGCTGCAG CCCGACCAGC GCTGA
 
Protein sequence
MPADPGPPSA QSSSEGPEAV RVPEEPEPEK PTPVEPADDA AAVTASGLPP ASDPSAGAPP 
SGHQPSGDPP AEHTPAGDPP AADGPSGGIP SVDGPSGEDP LREGILDEDP LGGSPPDDGA
AADPSAADPA DPTASTGSAD SADSAGPTTF VRPTSAAGPA PSAGPAEPAG DVPGDGSPPG
GLGRSVIRLL DRVDAAALPP VGRVLESVAG GLAGARDRPR ARLRRAWAAR VGPYADGDGP
PPPGRASVIT GRALEAIGRL LVLGLVVLIV VGAVTTMLRG ADPSGSHAPG PGPGNAPAGP
VEPTVTVGPA AGESPAAYAA EAEAKLDGLT RAAPDADLYA VVSLAGYRTP DEMLQIFTAY
RTLEVFFGVP PDGAVMAATV RDPVADLTAA FDSEADAADT RAHSATDPAE AEHARQEATA
LRARCGCLFG AVVRAPAARL VDLGHIEGVR VVDPAPPGIS PETVRFLPLQ PDQR