Gene Franean1_4673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4673 
Symbol 
ID5673015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5581632 
End bp5582798 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content70% 
IMG OID641243530 
ProductL-rhamnose isomerase 
Protein accessionYP_001508946 
Protein GI158316438 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4952] Predicted sugar isomerase 
TIGRFAM ID[TIGR02635] L-rhamnose isomerase, Streptomyces subtype 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.239648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.828695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAT TCAGCCAGAC GACAGAGGAC CTCGCTCGGC AGGAGATCGA GCTGCCCTCG 
TGGGCGTTCG GGAACTCCGG GACGCGCTTC AAGGTGTTCA CGCAGCGTGG CATCCCGCGT
GATCCGTTCG AGAAGGTCGC CGACGCGGCG CAGGTGCACC GGTTGACCGG GCTCGCGCCG
TCGGTCGCGC TGCACATCCC CTGGGACGTC GTCGACGACT TCGACAAGCT GGGCGAACAC
GCGCGGGCCA ACGGCGTCCG GCTCGGGACG ATCAACACGA ACACGTTCCA GGACGACGAC
TACCTGCTCG GCAGCCTCTG TCACGTCGAC GAGCGCGTCC GGGCGAAGGC GGTCAGGCAC
GCCCTCGACT GCGTCGACAT CATGGACGCG ACGGGCAGCC GCGACCTGAA GATCTGGCTG
CCCGACGGGC TGAACTACCC CGGCCAGGCC GACCTGCGCG ACCGGCAGGA GCGGCTCGCC
GACGCCCTGG GCCAGATCTA CGCCCGGCTG GCGGCGCACC AGCGGCTCGT GCTCGAGTAC
AAGCTGTTCG AGCCGGCCTT CTACGCCACC GACGTCCCCG ACTGGGGCAC GGCCTACGTC
CACTGCCTCG CGCTCGGGGA ACGTGCCGTG GTGTGTCTGG ACACCGGCCA CCACGCCCCG
CACACCAACA TCGAGTTCAT CGTGATGCAG CTGCTCCGGC TGGGGCGGCT GGGGGCGTTC
GACTTCAACT CCCGCTTCTA CGCCGACGAC GACCTGATCG TCGGCGCCGC TGACCCCTTC
CAGCTGTTCC GGATCATGAC CGAGGTCGTC CGGGGCGGTG GCTACGACGA GGGCAGCGAG
GTGACCTTCA TGCTCGACCA GTGCCACAAC ATCGAGGCGA AGATCCCGGG CCAGATCCGG
TCCGTGCTCA ACGTGCAGGA GATGACGGCC CGCGCGCTGC TCGTCGACCG GGCGGCGCTC
GCCGAGGCTG AGCGCGCCGG GGACGTGCTG GCCGCGAACG CGGTCCTCAT GGACGCCTTC
TACACCGACG TGCGCGCAGA CCTCGCCGCC TGGCGTGAGT CGCGCGGGCT GCCCGCCGAC
CCCCTGGCGG CCTTCCAGTC CAGCGGTTAC GCCGAGCGCG TCGCGGCCGA GCGGGTCGGT
GGCACCCAGG CCGGGTGGGG CGCGTGA
 
Protein sequence
MRAFSQTTED LARQEIELPS WAFGNSGTRF KVFTQRGIPR DPFEKVADAA QVHRLTGLAP 
SVALHIPWDV VDDFDKLGEH ARANGVRLGT INTNTFQDDD YLLGSLCHVD ERVRAKAVRH
ALDCVDIMDA TGSRDLKIWL PDGLNYPGQA DLRDRQERLA DALGQIYARL AAHQRLVLEY
KLFEPAFYAT DVPDWGTAYV HCLALGERAV VCLDTGHHAP HTNIEFIVMQ LLRLGRLGAF
DFNSRFYADD DLIVGAADPF QLFRIMTEVV RGGGYDEGSE VTFMLDQCHN IEAKIPGQIR
SVLNVQEMTA RALLVDRAAL AEAERAGDVL AANAVLMDAF YTDVRADLAA WRESRGLPAD
PLAAFQSSGY AERVAAERVG GTQAGWGA