Gene Franean1_0630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0630 
Symbol 
ID5669047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp729666 
End bp731117 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content74% 
IMG OID641239557 
Productextracellular solute-binding protein 
Protein accessionYP_001504995 
Protein GI158312487 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.679852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGGA ACGTGTCAGA ACCCCGCCCG CGGCACCTGG ACCCGCGAGG CCCCGGCACC 
CGCCCGGGCC CGGGCACCCG CCGGGCCGGC TCGTCCCCAC GCCGCGCCGG CCCGCTGGCC
GCGCTGCTCG CCGGGACGGT CACGGTCGGC CTGCTCGCGG GCTGCGGCGG CGGCTCCGGC
GCGGCCACCG ATCCGGCCGA GAGCCTGCGT CCGACGGCGC GGCCGGCGAC CGCGAACGTG
GACGACGTCG CCGGCGCTAA GGCGTCGCCG GAGTGCGCCG CCGCGGTGAA GACGCTGCGG
ATGTTCGCCC TGGGGTCGCT CAACGACGCG GCGAAGTCGG GCAAGGCGTA CATGGAGAAG
GCCCATCCCG GCCTGACCGT CGAGCTCACG GCCGACGCGA CCGGCTACCC CCAGCTCGTC
CAGCAGATCA GCGCGGACCG GGCCGCCGGG CGCCCGGCGG ACGTCGCGGT CGCCGGCTTC
GACCTGCTGC CGACCTTCGC CGACAAGCTC GGCGCGCAGC CGCTGTCGCC CCGGTTGCTG
CGGGCGTCCT ACGACCAGCG GTTCCTGCCG CTCGGCGAGT ACGGCGGCCG GCTCGTCGCG
GTGCCGCAGC AGGTGTCCAC GCTCGCACTC GTCTACAACG CGGACGTCCT GGCGAAGGCC
GGGGTCGACC CGAAGACGCT GGGCACGACC ACGGGGGTGC TCGCCGCGGC CGAGCGGATC
AGGAACTCCG GTCAGCAGAT CCAGCCGATC GACCTGCCGA CGGGCGGGTT CGCCCAGTGG
TACCTGACCA CGCTGGCCAG CTCGAAGAAC ACCCCCGCGA TGAAGGCGGA CGGCCAGCCC
GACCTGACCA GCCCGGCCGT CCGCGAGGCC GCCGCGTTCC TGGCCAAGGT GGGCACCTAC
GGAACACAGT CGAGCGACCC GACCACGCAG GGCCTGCTGC GCTTCGGCAT CCGGCACGAG
ACGGCGATCA GCGCGGTCAC CCTGCCGTCG CTGGCCGCGG GGCTGCGCTA CGTCCATGAC
CAGGGCGCGC AGGGCTTCAA GGTGGGTGTC GCCCCGTTCC CGACCCTGCC CGGGGGCACT
CAGCACCCCG TCGCGGGCGG CAACGGGCTG TCCGTACTGT CGACGGACCG CTGCCAGCGG
GAGATGGCCA CCGAGCTGGT CGTCGCGCTG CTCGCCCCGG ACGTGATCGC CGCCGGCACC
GAGGCGTTCA GCTTCCTGCC GGTCGACACC GAGGCCCGCA GGCAGCTCGC GCCGTTCTAC
CGGGAGTTCC CGGAGCTGAC CCAGTTCGAC GCGCTGATTC CGGATCTCGT CCAGGCGCCG
ACCTGGGGCG GTGAGCGCGG CGGCGAGGTC CACGACGCGC TGAACGACGA GGTGGTCGCA
ATCATGTCAG GAGCCGACCC CGGCACCACC CTCACCGAGG CCCAGCGGAA GATCGCCACC
CTGGTGAAGT GA
 
Protein sequence
MLGNVSEPRP RHLDPRGPGT RPGPGTRRAG SSPRRAGPLA ALLAGTVTVG LLAGCGGGSG 
AATDPAESLR PTARPATANV DDVAGAKASP ECAAAVKTLR MFALGSLNDA AKSGKAYMEK
AHPGLTVELT ADATGYPQLV QQISADRAAG RPADVAVAGF DLLPTFADKL GAQPLSPRLL
RASYDQRFLP LGEYGGRLVA VPQQVSTLAL VYNADVLAKA GVDPKTLGTT TGVLAAAERI
RNSGQQIQPI DLPTGGFAQW YLTTLASSKN TPAMKADGQP DLTSPAVREA AAFLAKVGTY
GTQSSDPTTQ GLLRFGIRHE TAISAVTLPS LAAGLRYVHD QGAQGFKVGV APFPTLPGGT
QHPVAGGNGL SVLSTDRCQR EMATELVVAL LAPDVIAAGT EAFSFLPVDT EARRQLAPFY
REFPELTQFD ALIPDLVQAP TWGGERGGEV HDALNDEVVA IMSGADPGTT LTEAQRKIAT
LVK