Gene Franean1_5187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5187 
Symbol 
ID5673521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6227480 
End bp6228637 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID641244041 
Productsolute-binding protein 
Protein accessionYP_001509451 
Protein GI158316943 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0173275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGACA ACTCGCTCGA CGACGGGGTA CCTCGGCGCG GTCGGATGCG CGGCCATGCG 
GGGAGACGGC GAGGTCGCCT TTTCGTGGCC GCGTCGGTGG CGGTCATGGT CGTGCTGGCC
GCCGTGGGAT GCACATCGGA CTCGTCCGAT GACGAGGTGC CCCAGGGAGG ATCCGAGAGC
GGAACCATCG CGCTCCTGTT ACCCGAGACC CAGACGACCC GCTACGAATC GGCCGACCGC
CCCTACTTCG AGGCGCGGAT GGCGAAGATC TGCCCCGACT GCAAGGTGCT GTACTCGAAC
GCCGACCAGG ACTCGGCCGC CCAGCAGAAC CAGGCCGAGC AGGCCATGAC CAACGGCGCC
AAGGTCCTCG TCCTGGACCC GGTGGACGGC GAGGCCGCGG CGGTGATCGC CCGCAATGCG
CGGGACCGTG GCGTGCGCGT GGTCTCCTAC GACCGGCTCA TCCAGAAGGC GCCCGTCGAC
GCCTACATCT CCTTCGACAA TGAGAAGGTC GGCCAGTTGC AGGGCCAGGC GCTCCTCGAC
GCGATCGGCG ACCGGGCCGG CGCCGGCAAG GTCATCATGA TCAACGGCTC GCAGGACGAC
CCGAACGCCC AGCAGTTCAA GGACGGCGCG CTGTCGGTCC TGGAGGGCAA GGTGACGATC
GGCTTCGACA CGTTCACCCC CGACTGGTCT CCCGACACCG CCGGTCGGGA GATGGACCAG
GCGATCACCA CCGTCGGCCG GGAGAACATC GTCGGGGTCT ACGCCGCGAA CGACGGCATG
GCCGGCGCCG TGGTCGCCGC GCTGCGCCGG GCGAACGTGA ACCCGCTGCC GCCCGTCACC
GGCCAGGACG CCGAACTCGC CGGGGTACAG CGCGTACTCG CCGGAGATCA GCACATGACC
GTCTACAAGG CCATCCGCCC CGAGGCGGAG CAGGCGGCCG ACCTGGCGCT CGCGCTGCTG
CGCGGTGAGC CCGTCGACAC GATCGCGACC GGGCACGTCG ACAACGGCAA CGGCCAGGTT
CCCGCCGTCC TGCTGGAACC GGTCGCGGTC ACCCGGGACA CCGTCGCCGC GACGGTGGTG
AAGGACGGCT TCATCGCCAA GGCCGACCTG TGTGCCGGCA CGTACGCGAC AGCCTGCGCG
TCCGCCGGCA TCTCCTGA
 
Protein sequence
MADNSLDDGV PRRGRMRGHA GRRRGRLFVA ASVAVMVVLA AVGCTSDSSD DEVPQGGSES 
GTIALLLPET QTTRYESADR PYFEARMAKI CPDCKVLYSN ADQDSAAQQN QAEQAMTNGA
KVLVLDPVDG EAAAVIARNA RDRGVRVVSY DRLIQKAPVD AYISFDNEKV GQLQGQALLD
AIGDRAGAGK VIMINGSQDD PNAQQFKDGA LSVLEGKVTI GFDTFTPDWS PDTAGREMDQ
AITTVGRENI VGVYAANDGM AGAVVAALRR ANVNPLPPVT GQDAELAGVQ RVLAGDQHMT
VYKAIRPEAE QAADLALALL RGEPVDTIAT GHVDNGNGQV PAVLLEPVAV TRDTVAATVV
KDGFIAKADL CAGTYATACA SAGIS