Gene Franean1_5078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5078 
Symbol 
ID5673413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6077590 
End bp6080019 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content73% 
IMG OID641243929 
Producthypothetical protein 
Protein accessionYP_001509343 
Protein GI158316835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0204601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTTCGG CGGCACACAG AGCTGACAAC GGCGCGAGCA AGCGTGCTCG TGCCGCGGGT 
AGTCGCTGCC ATCACCAGGA ATCCGACAAC AACGTCGACG CAACGCGATA CAACTCCGCA
GAGCATTCTC TACGAAAGGA CACCCGATTA GCCATGGCGG GGGACGTACC GGCCGACGAG
CGGATCCTCA TCACGCTGGA TGACCCCGAC CCCGAGGCCG ACGCGCGGCG CGGCCTGGTC
GGTCTGTCGC GAAACGCCGC CGGTGGCGCT GAGGGTGAGC TCCCGGATGG TTCCGCTGGC
GACGTCGGTG GCTCCGGCGG AACCGGTGGC GCCGGCCCGT CCGAGATCTC CGGCCCGAGC
GGCCCGGGTG ACCCCGGTGC CCAGGACGTG ACCGAGGACC CCCGCCACCG TGAGCGCGCC
ATCCACCGCT ACGGCCGCGT CCGGGTGGTG GGCCGGCCGC TGGCGACCGC GGCCGTGGCC
ACCGCTCGCG GGGTCTCCGC CGGCACCACC CCGGTGGACG GGACCGTCGT GTCGACGCTG
AGCCTGCCCG GCGGGCTGGA CCGCACCGAG GCGCTCGGGG TCGAGGCGTT CCGCCTGCGC
GCGGGGGACG ACTACCGCCG GCGCAAGCGC TCGCGGCCGC GCGACGCCGC GCCCTGGGAC
ATGAACCGGC CGTGCACCGA CATCGCCCCG CCGCGCGGCT CTGCGGGCCC AGGCGCCACC
CCGCTCGCAC CCACCGTGCG CTCGCTGCTC AACGGGCTCG GATCGGACGC CTCGGGACCG
GAGGCGACCG GGTCGCCCTC GGGCGCGGTC GGGACGGGCA CGGCCGGAGC GCCGTCCGCC
GGAGCGCTCA GCAGTTACCT GGAGGGCTCC GTCGCGGTCG GGCTCATCCT CGTCGAGGGG
CCGACGCCGG CCCTGCAGCT CTCCGCCGCG GAACGCACGA AGATCGTGGC CGAGGTGCAG
AACGGCCTGT CCTGGTACGC GACGCAGAAC CCGGCGGCCG AGCTGACGTT CAGCTACGAC
ATCCAGATCG TCCGGCTGCC CACCCCCGCC AACCCGTCCG CGAACGACCT CGAGGCCCTC
TGGCGCGACC CCACGATGAG CCGGCTCGGC TACGCGGCGA ACTTCGACGG TGTCTACGAC
TATGTCGAGG CGCTGCGCGC CCGGCTGCGT ACCCGGTGGG CCTACTGCGC GTTCGTCACG
AAGTACCCGC TCGGCCACTT CGCCTACGCG TCGGTCGGCG GCCCGCGGAT GGTGCTCGCC
GCCGACGCCG ACGGCTGGGG CCCGGACAAC ATCGACCGCG TGTTCGCGCA CGAGACCGGC
CACATCTTCG GGGCGCCGGA CGAGTACGGC GGCGCCGGCT GCGACTGCGG CGGGAGCTGG
GGTCGGTACG GGGTACCCAA CGGCAACTGC GACTCCTGCG CGCCGGCGCC GGTCGACTGC
CTGATGCGCG CCAACACGTT CGCCCTGTGC CGGTACACGC CCGCGCACAT CGGCTGGGGC
CACGGCGTGA GCGGCAACCC GGTGCTCCTC CAGGCGAAGG GCCTGGGCGT GCGGGGCAAC
TTCGACGTCG TCGCGCCGTC GGCCTACGCC GGCCTCACCC ACGTCTGGCG GGACAACGAC
GCCGCCGGCG CACCGTGGCG GGACCCATGG CAGACCGCGC AGGCCCTCGG CCGCGTCGAC
GCGGTGACGA TGGTGCAGAG CACCCTGGCG AACCCGGGCC CGCTGGAGGT GGCCGTCCGC
GTGGGCTCGC GGCTGTTCTT CCTGTGGCGG GACAGCACCG GGGCGTTCCA GTGGCGCGCG
CCCGTCCAGC TCGCCCAGGG CGTCGGCGGC GTCCCGTCGC TGGTGCAGAG CAGGCTGGGC
GGCAAGGGCA ACTTCGAGCT GCTGGCCCCG GCCGCTGATG TCGGCATCAT GCACATGTGG
CGCAACCACG ACGTCTACGG CTACCCGTGG AGCGCGCCGA AGCTGTTCGC GGCCAACCTG
GGACGGGTCG ACGCGGTCAG CCTGATCCAC GGGACGCTGG GCGGCGGGGC CGGGATGCTG
GAGGCCGTCG CCCTGGTCGG CACCCGACTG GTGCACCTGA CGCGCGACCA GGCAGCCGTC
TGGCGCACCG GCGGGATCTT CGCCGAGGGG GCGTCCGGCA ACCCGGCGCT GATTCAGAGC
GTGTTCCCGG GGGCGCGCAA CTTCGAGGTC GTGGTGCCGT CCGCCGGAAC CGGGCTGATC
CACTTCTTCC GCGACAACAA CCGCGCCGAC CCGGTGTGGA GCGGCCCCCG CCCGTTCGCA
GCGGAACTCG GGCATGTGGA CGCCGTCTCG ATGATCCAGA GCAACTACGA CGGGAACCTC
GAGGTGCTGG CCCGGGTCGC GAACCGGCTG TACCTGCTGT ACCGCTCAGG CGCCGCGGCC
GTGTGGTCGG CGCCGCGCCG CGTCTTCTGA
 
Protein sequence
MPSAAHRADN GASKRARAAG SRCHHQESDN NVDATRYNSA EHSLRKDTRL AMAGDVPADE 
RILITLDDPD PEADARRGLV GLSRNAAGGA EGELPDGSAG DVGGSGGTGG AGPSEISGPS
GPGDPGAQDV TEDPRHRERA IHRYGRVRVV GRPLATAAVA TARGVSAGTT PVDGTVVSTL
SLPGGLDRTE ALGVEAFRLR AGDDYRRRKR SRPRDAAPWD MNRPCTDIAP PRGSAGPGAT
PLAPTVRSLL NGLGSDASGP EATGSPSGAV GTGTAGAPSA GALSSYLEGS VAVGLILVEG
PTPALQLSAA ERTKIVAEVQ NGLSWYATQN PAAELTFSYD IQIVRLPTPA NPSANDLEAL
WRDPTMSRLG YAANFDGVYD YVEALRARLR TRWAYCAFVT KYPLGHFAYA SVGGPRMVLA
ADADGWGPDN IDRVFAHETG HIFGAPDEYG GAGCDCGGSW GRYGVPNGNC DSCAPAPVDC
LMRANTFALC RYTPAHIGWG HGVSGNPVLL QAKGLGVRGN FDVVAPSAYA GLTHVWRDND
AAGAPWRDPW QTAQALGRVD AVTMVQSTLA NPGPLEVAVR VGSRLFFLWR DSTGAFQWRA
PVQLAQGVGG VPSLVQSRLG GKGNFELLAP AADVGIMHMW RNHDVYGYPW SAPKLFAANL
GRVDAVSLIH GTLGGGAGML EAVALVGTRL VHLTRDQAAV WRTGGIFAEG ASGNPALIQS
VFPGARNFEV VVPSAGTGLI HFFRDNNRAD PVWSGPRPFA AELGHVDAVS MIQSNYDGNL
EVLARVANRL YLLYRSGAAA VWSAPRRVF