Gene Franean1_5931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5931 
Symbol 
ID5674252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7203844 
End bp7205205 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content77% 
IMG OID641244779 
Producthypothetical protein 
Protein accessionYP_001510181 
Protein GI158317673 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.090162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00187432 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAGGCG TCTTCGTCGG CCTGACGACG CTTGATTCGG TCTACCTCGT GGATCGGCTT 
CCAGGTGCCG ACGAGAAGTG TGTGGCGCGG GATTTCGCCA TGAATGCGGG CGGGCCCGCG
ACGAACGCCG CGGTGACGTT CGCCTACCTC GGCGGGCGGG CCTGCCTCGT GAGCGCCATC
GGCACGTCCC CGGCCGCGGC GCTCGTGCAT GCGGATCTCG CGCGCTACGG GGTCCGGCAC
ATCGAGCTGG TGCCGGAGCA GTCCGGCGAC GGTGCCGCCG GTCCCTACGG AGCCCTCGCC
GCGCAGGGAG TGGCCGGTCC CGCCGGCCAG CACGGCCCGC CCGGCAAGCC CACCGGGCTG
CGCTCCGGCG GCACCCGCTC CGGCTACGGC CCGGCGGGAC CGCTCACCGG ATACCCGCAC
GTGCGCGGCC GGGGCGGGCC CGGCCTGCCC GGCGGCGCGA CCCACAGCCC CGCCGGGGCG
GTCAAGCCAG CCGGCCACGT GCACGGCCCG GTCGGCGGCG CCCACGGCGG CGGGCACGGT
GGGCACGGCA GCGGACCCGG CGGTGGCCAC GGTGGGCCTG GCGGTCACGG TGCGATCGGC
GGTCACGGCG GCCACGGCGG GATCGGCGGC CACGGGGTGG CCGGCGGGGC CGGGCTCGGT
GGAGCCGGCG TCCTCGGCGG GATGGCCGGC CTGGCCGGTG CCGTCCAGAC CGGGCCGAAC
ACGGCGGCCG CCGGCCACCT GGCCGGCCAG TCGGCGATGT CCTACGCCCT CCCGATGTCA
GCGGTCATGG TGACCTCGCA GACCGGTGAG CGTGCGGTGA CCTCGACGCA CGGCATGGTC
CCCCGGTGCA CCGCGAACAC CTCCGCCGCG GCCGCGGTCG CCGACGCCGA CGTGGTCGTG
CTCGACGGCC ACCAGGTCGA CGCCGCCATC GGCCTGCTCC GCACGCTGCG CGGCTCAGGC
CCGCCGGTCC TCCTCGACGG CGGAAGCTGG AAGCCCGGCA CCGAGCAGAT CCTGCCGTTC
GTCGACGTCG TGATCTGCTC GACCGCGTTC CGCCCCCCGG GCTTCGACCC CGGCGCGGAC
ATCCTCGGCC TGCTGCTGCG CTACGGGCCG TTCTTCGTCG CCGTCACGGA CGGCCCCGGG
CCCATCCGCT GGGCCACGGC GGAACGGCGC GGCCACGTGC TGCCCCCGGT GGTGGCCGCC
CGCGACACCC TGGGCGCCGG CGACGTCTTC CACGGTGCCT TCGCCTGGAT GATGGCCCAC
GGCGCGCTCG CGACCGACGA GCTGGTCGGA GCCCTCGGCG AGGCCTCACG GGTCGCCGCC
CGCTCCGTCC AGACCTTCGG CCCCCGCAGC TGGATGACCT GA
 
Protein sequence
MKGVFVGLTT LDSVYLVDRL PGADEKCVAR DFAMNAGGPA TNAAVTFAYL GGRACLVSAI 
GTSPAAALVH ADLARYGVRH IELVPEQSGD GAAGPYGALA AQGVAGPAGQ HGPPGKPTGL
RSGGTRSGYG PAGPLTGYPH VRGRGGPGLP GGATHSPAGA VKPAGHVHGP VGGAHGGGHG
GHGSGPGGGH GGPGGHGAIG GHGGHGGIGG HGVAGGAGLG GAGVLGGMAG LAGAVQTGPN
TAAAGHLAGQ SAMSYALPMS AVMVTSQTGE RAVTSTHGMV PRCTANTSAA AAVADADVVV
LDGHQVDAAI GLLRTLRGSG PPVLLDGGSW KPGTEQILPF VDVVICSTAF RPPGFDPGAD
ILGLLLRYGP FFVAVTDGPG PIRWATAERR GHVLPPVVAA RDTLGAGDVF HGAFAWMMAH
GALATDELVG ALGEASRVAA RSVQTFGPRS WMT