Gene Franean1_5357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5357 
Symbol 
ID5673691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6461468 
End bp6462817 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID641244215 
Producthypothetical protein 
Protein accessionYP_001509621 
Protein GI158317113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGG CGCAGGATGT GGACGGAGCA ACAGACGGGT CATCCGGGGC GGACGGGCTG 
ACCGTTCGCT TGACGGCGTG GGAGGCCCAA GTCAACGCGC TCGCAGTCCT GCAGCTGTGT
GCGACCGGCA AGCTGCGGTG CAGTGAGAAG ACACAGCGGC CGGGTGCGGC CACCGTCACG
GCGGTCGCCG GAGTGTTGCT CCGTGGCGAT TTCTATCTCA CCGAGCCGAT CGCGGCGTTC
GCCTGGCCGC TGTTGCTGCA GGCCGGTGGG CTGGCAGAGG TTAGCGGGGG ACGTCTGGTG
CTGACAGCGC GGGGCCGGTC GGTGCTGGCC AAACCGGCGG CGGAGACCAT CCGGCAGCTG
TGGAGGCGCT GGGTCGGCCA TGCGGTCCTC GACGAGATGA GCCGGATCGA GACGATCAAG
GGTCAGCGGT CCGCCCGGGC GCTGACCGCG GCAGGTCCCC GGCGCAAGGC CGTTGCCGAG
GCGCTCGCGG CCTGCCCGCC AGGCGAGTGG GTGGCGGTGG ACATGCTTTT CACCCGGATA
CGGCGTGGAG GACCGCCGTT CACCGTCGCA CGTGACGTCT GGAAGCTGTA CATCGAAGAT
CCGCAGTACG GCAGCCTGGG ATATGACGGG TTCCACGACT GGCCGCTGCT GGAGGGCCGG
TATCTGCTGT GTCTGCTGTT CGAGTACGCG GGCACGCTTG GCCTCTTTGA TCTCACCTAC
GGCGATCCAG TTGACGCCCG TGACGATTTC CGCGAGAACT GGGGCGCTGA CGACCTGGAC
TTCCTCAGCC GGTACGACGG TTTGCGTGCC GTGCGGATGA ACGCCCTCGG CGCCTATGCG
TTCGGCCTGG CTGACCGGTA CGAGCCCGAG GGGGACAGCA CCGCAGCCGG TCCAGCGCTC
AAGATCCTGC CGAACCTGGA CATCGTCATC ACCGGTGAGG TCCAACCGGC CGACGAACTC
CTCCTGAGCG TTTACGGGCA GCGCGCCTCC GATCGCATCT GGTCCCTGAC CATCAAGTCG
TTGATGGCCG CGATCGACGC GGGTAGACCA GTCGAGGAGC TGCGGCGGTT CCTGACCGAC
CGCGCGGACC ATGAGCTACC GGCCACCGTT GCCAAGCTCT TCGAGGACGT CACCGCCCGC
GCCGGCCAAC TACGGGATCT CGGGTTGGCG CGAGTGATCG AATGTGTCGA CCCGGCGGTG
GCCGCCCTCA TCGCTAACGA CCGCAAACTC CGCCGACTCT GCACCCTGGT AGGCGATCGT
CACATCGCTG TACCCTTGCA GCACGAGGCC GATTTCCGCG CAGCCCTGCG GAAGCTGGGA
TACGCCATAC CTTCCTCGCC AGGGGCGTAG
 
Protein sequence
MTVAQDVDGA TDGSSGADGL TVRLTAWEAQ VNALAVLQLC ATGKLRCSEK TQRPGAATVT 
AVAGVLLRGD FYLTEPIAAF AWPLLLQAGG LAEVSGGRLV LTARGRSVLA KPAAETIRQL
WRRWVGHAVL DEMSRIETIK GQRSARALTA AGPRRKAVAE ALAACPPGEW VAVDMLFTRI
RRGGPPFTVA RDVWKLYIED PQYGSLGYDG FHDWPLLEGR YLLCLLFEYA GTLGLFDLTY
GDPVDARDDF RENWGADDLD FLSRYDGLRA VRMNALGAYA FGLADRYEPE GDSTAAGPAL
KILPNLDIVI TGEVQPADEL LLSVYGQRAS DRIWSLTIKS LMAAIDAGRP VEELRRFLTD
RADHELPATV AKLFEDVTAR AGQLRDLGLA RVIECVDPAV AALIANDRKL RRLCTLVGDR
HIAVPLQHEA DFRAALRKLG YAIPSSPGA