Gene Franean1_2292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2292 
Symbol 
ID5670691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2738443 
End bp2741040 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content74% 
IMG OID641241212 
Producthypothetical protein 
Protein accessionYP_001506633 
Protein GI158314125 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein
[COG2308] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.649548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGCGGATCGC GGCCGTCGGT GTTGAAGAAG AATTCCACAT TCTCGATCTC 
GCTACCAGGC AGCTGGTACC CCGCGCGGAG GAGATCCTGC GCGGCCTCCC CGACGACCAG
TTCTCCCCGG AGCTGCTGCG CTCCGTGGTC GAAACCAACA GCCGGCCCTG TACCGACCTC
TCCGACCTCC GGGCGGACCT GCTCGACCTG CGCCGCCGCC TCGCCGCGGT CGCCGAGCCG
CTGGGCCTCG GCCCCGCCGC GGCGGGAACG GTGCCGATCG TCGACATGTC CGTCCTCGAC
GTCTCGCGGG ACGCGCGCTA CATCCAGATG ACCGAGGAGT ACCAGCTCCT CGCGCGCGAG
CAGCTCATCT GCGGCGCCCA GGTCCATGTC GACGTCGCCG ACCGCGACCT GGCGATGGCC
GTCACCGCCT GGGTGGCCCC CTGGCTGCCG ATGCTGCTGG CGCTGTCGGC GAGCTCGCCG
TTCTGGCGCG GTGCCGACAG CGGCTACGCG AGCATGCGGA CGATGGTCTG GCAGCGCTGG
CCGACCGCCG GCGTGGCCGG GCCGTTCCGC ACCGCCGCCG AGTACGACCA GCTGGTGGCG
GACCTGGTGA AGTCGGGCGT CATCAGCGAC CCGGGCATGG TCTACTTCGA CGTCCGGCCG
TCCGCGCACC TGCCCACCGT CGAGCTGCGC ATCTGCGACG CCTGCCCGGA CGTCGACAAC
GTCATCCTGA TCGCCGGCCT GTTCCGGGCG CTGGTGTGCC GGGCGATCGA GGCGGTCGAG
GCGGGCGCAC CGGCTCCGCC GCCGCGCGCC GAGCTGCTGC GCGCGGCGAC CTGGCGCGCG
GCCCGCTCCG GCATCGAGAG CGACCTCGTG GACCTGTCGG GCACCGGCTG CATGCCGGCC
GAGGAGCTGC TGCGCCGGCT GCTCACCGAG GTCCGTCCCG ACCTGGAGAA GGTCGGGGAC
TGGGACCTGG TGCGTGACCT GGCCGAGGCG GCGGTGGGGC GGGGGAGCGC CGCGTCCCGC
CAGCGCCGGG CTTTCGCCCG CCGCGGCCTG CTGACCGACG TCGCGGACCT CGTGCTGGCC
GAGACGCGCG AGTCGCCGGC GTCGGCGGTC CACCCGCCGG GCACCGTCCC GTCGGTGGGC
GCGGTGGCAG CGCTGCCGCC GCTGCTGGAC CGCTACCAGC CGTCCGGGTT CGACGAGGTC
ATCGCCGAGG GCGGTGGTGT CCGGCCGCAC TACCGCGGCG TGGTGCGCAC CCTGGACCGC
CTCGGCCCGG CGGTGCTCGC CGAGCGCGGC GAGGCCATGC AGGCCGAGCA GGTCGAGCGC
GGCGTGGTGT TCCGGGTGAA CGGCGAGACC GAGCAGCGCC CGTTCCCGTT CGACCTGGTT
CCGCGGGTCG TCACCGCGGG CGACTGGGAG CGCCTGCAGT CCGGGCTCAC CCAGCGCGTG
CGGGCGCTGG AGGCGTTCCT GCGCGACACC TACTCCGAGC GCGCCGCGGT CGCCGACGGG
GTGATCCCGG CCTGGCTGGT CAACGACTCG CCGGGGCTGC GCCACTCCGG CCGGGTCCTC
TCCGGCGACG GGTCGCCGCG CTCCGGCGGT GTGCGGGTGA CGGTGGCGGG CATCGACCTC
GTCCGCGGCG CCGACGGCAA GTGGCTCGTG CTGGAGGACA ACCTGCGGGT GCCGTCCGGG
ATCGCCTACG CGATCGAGGG CCGGCGGCTG ACCCGGTCGG CGCTGCCCGA GCTCAACCCG
CCCGGCGCCA TCCTGGGGGT GGACGCGGTG CCGGCGCTGC TACACGAGGC GCTGGTCGCG
GCCGCCCCGC CGGCGGTGCG CGGCGAGCCC GCCGTCGCCG TCCTCACCGT CGGCGAGGAG
GACTCCGCCT ACTACGAGCA CACCTTCCTC GCCGAGGAGA TGGGGGTGCC GCTGCTGACC
CCGGCCGACA TCCTGGTCGA CGACGACGTG CTCTACGCCG TCGACGGCGG GCGGCGGCGG
CGGATCGACG TCCTCTACCG GAGGGTGGAC GAGGACGAGC TGACGGGGCT GCCGGGCGCT
GACGGGCTGC CACTGGGCCC CGGGCTGCTG CGCGCGGTGC GGGCTGGCTC GCTCGCGCTG
GCGAACGCGC TGGGCAACGG GGTCGCCGAC GACAAGGTCG TCTACGCCTA CGTCTCCCGG
ATGATCACCT ACTACCTGGG TGAGCAGCCG CTGCTCGACG ACGTCCCGAC CTATGTGTGC
GGTGATCAGG AGCAGTGCTC GCACGTCCTG GAGCACCTCG AGCAGCTCGT CGTGAAGCCG
GTGGACGGCT ACGGGGGCTC CGGGGTGGTG ATCGGCCCGC AGGCCGAGCC GTTCGAGCTG
ACCGAGGTCC GCGAGCGGAT CCTGGCCGAC CCGCGTGGCT GGATCGGCCA GGAGATGGTC
GCCCTGTCGA CCCATCCCAC CTGGGTCGAC GGTGAGCTCC AGCCGTGCGC GGTCGATCTG
CGGGCCTTCG TCTACGCCGG CCGCGAGACG GCGGTGGTCG CGCCGGCCGC CCTGAGCCGG
GTCGCGCCGC CGGGCAGCCT GATCGTCAAC TCGTCCCGGG GCGGCGGGTC GAAGGACACC
TGGCTGCTGC GTCCCTGA
 
Protein sequence
MSDARIAAVG VEEEFHILDL ATRQLVPRAE EILRGLPDDQ FSPELLRSVV ETNSRPCTDL 
SDLRADLLDL RRRLAAVAEP LGLGPAAAGT VPIVDMSVLD VSRDARYIQM TEEYQLLARE
QLICGAQVHV DVADRDLAMA VTAWVAPWLP MLLALSASSP FWRGADSGYA SMRTMVWQRW
PTAGVAGPFR TAAEYDQLVA DLVKSGVISD PGMVYFDVRP SAHLPTVELR ICDACPDVDN
VILIAGLFRA LVCRAIEAVE AGAPAPPPRA ELLRAATWRA ARSGIESDLV DLSGTGCMPA
EELLRRLLTE VRPDLEKVGD WDLVRDLAEA AVGRGSAASR QRRAFARRGL LTDVADLVLA
ETRESPASAV HPPGTVPSVG AVAALPPLLD RYQPSGFDEV IAEGGGVRPH YRGVVRTLDR
LGPAVLAERG EAMQAEQVER GVVFRVNGET EQRPFPFDLV PRVVTAGDWE RLQSGLTQRV
RALEAFLRDT YSERAAVADG VIPAWLVNDS PGLRHSGRVL SGDGSPRSGG VRVTVAGIDL
VRGADGKWLV LEDNLRVPSG IAYAIEGRRL TRSALPELNP PGAILGVDAV PALLHEALVA
AAPPAVRGEP AVAVLTVGEE DSAYYEHTFL AEEMGVPLLT PADILVDDDV LYAVDGGRRR
RIDVLYRRVD EDELTGLPGA DGLPLGPGLL RAVRAGSLAL ANALGNGVAD DKVVYAYVSR
MITYYLGEQP LLDDVPTYVC GDQEQCSHVL EHLEQLVVKP VDGYGGSGVV IGPQAEPFEL
TEVRERILAD PRGWIGQEMV ALSTHPTWVD GELQPCAVDL RAFVYAGRET AVVAPAALSR
VAPPGSLIVN SSRGGGSKDT WLLRP