Gene Franean1_4665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4665 
Symbol 
ID5673007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5568222 
End bp5569439 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID641243522 
Productxylose isomerase 
Protein accessionYP_001508938 
Protein GI158316430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02631] xylose isomerase, Arthrobacter type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.025606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.682712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGAC AGCCCACTCC CGAGGACAAG TTCTCCTTCG GCCTGTGGAC GGTCGGCTGG 
ACCGGCACCG ACCCGTTCGG CCTGCCGACC CGGACGGCCC TCGACCCGTG GGAGTACGCC
GACCGGCTGG CCGAGATAGG CGCCTGGGGC ATCACCCTGC ACGACAACGA CGTCTTCCCC
TTCGACGCCG ATGACGCCGC CGCCGCGCGG GCGTCCCGCC GGCTCAAGGA GGCCACCGAC
GCCTCCGGCC TGGTCATCGA GATGGTGACC ACGAACACCT TCACCCATCC CGTCTTCAAG
GACGGCGGCC TGACCTCGAA CGACCGCGGC GTGCGCCGGT TCGGCCTGCG CAAGGTGCTG
CGCGCGGTGG ATCTCGCGGC GCAGCTCGGC GCGACCACGT TCGTGATGTG GGGCGGCCGG
GAGGGCAGCG AGTACGACGG GTCGAAGGAC GTCTTCGCCG CGCTGGAGCG CTACCGGGAG
GGCCTGGACA CCGTCGCCGG CTACATCAAA AGCCAGGGCT ACGACCTGCG GATCGCGCTC
GAACCCAAGC CGAACGAGCC GCGCGGCGAC ATCCTCCTGC CCACCGTCGG GCATGCGCTG
GCGCTGATCG CCGAGCTGGA GAACGGCGAC ATCGTCGGGG TCAACCCGGA GACCGGGCAC
GAGCAGATGG CCAACCTCAA CTACACCCAC GCGCTCGGCC AGGCACTGTG GAGCGGGAAG
CTGTTCCACA TCGACCTCAA CGGGCAGCGG GGCCTGAAGT ACGACCAGGA CCTGGTCTTC
GGGCACGGCG ATCTCGTCTC GGCGTTCTTC ACCGTCGACC TGCTCGAGAA CGGCTTCCCG
GGCTACCCGG ACGGCCCCCG GTACACCGGT CCCCGCCACT TCGACTACAA GCCGTCGCGG
ACGGAGGGCA TGGCCGGGGT CTGGGAGTCG GCGCGGGCGT GCATGTCGAC CTACCTGCTG
CTCGCCGAGA AGGTGGCGGC GTTCCGCGCC GATCCCCTCG TCCAGGAGGC GATGGCCTAC
GCCGGCGTGT TCGAGCTGGC CAAGCCCACC CTCGCCCCCG GCGAGACCGC GGCCGATCTC
CTGGCCTCGG ACGACGGCTT CGACCCGGCC AAGGCGGCCG AGCGGGACTT CGGTTTCGTG
CGGCTCCAGC AACTGGCGAT CGAGCATCTC GTCGGCTCGC CCGCCCCGGG CTCGGCAGCC
GCCCGTTCCG CCGGCTGA
 
Protein sequence
MPRQPTPEDK FSFGLWTVGW TGTDPFGLPT RTALDPWEYA DRLAEIGAWG ITLHDNDVFP 
FDADDAAAAR ASRRLKEATD ASGLVIEMVT TNTFTHPVFK DGGLTSNDRG VRRFGLRKVL
RAVDLAAQLG ATTFVMWGGR EGSEYDGSKD VFAALERYRE GLDTVAGYIK SQGYDLRIAL
EPKPNEPRGD ILLPTVGHAL ALIAELENGD IVGVNPETGH EQMANLNYTH ALGQALWSGK
LFHIDLNGQR GLKYDQDLVF GHGDLVSAFF TVDLLENGFP GYPDGPRYTG PRHFDYKPSR
TEGMAGVWES ARACMSTYLL LAEKVAAFRA DPLVQEAMAY AGVFELAKPT LAPGETAADL
LASDDGFDPA KAAERDFGFV RLQQLAIEHL VGSPAPGSAA ARSAG