Gene Franean1_4531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4531 
Symbol 
ID5672880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5405484 
End bp5406854 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID641243396 
Productgeneral substrate transporter 
Protein accessionYP_001508812 
Protein GI158316304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTAC AAGATGTACC TGCGCGGGGT CCAGCCTCCG ACCAGTCTGT CGCCAGCGAT 
AACGGGTCGG ACCCGAAGAA GATGCGCAAG GTCGCCCTGG CGAGCTTAAT GGGAACCGTC
GTCGAGTTCT ACGATTTCGG TATTTACGCC ACCGCGGCGG CGCTGGTCTT CGCGGATGCG
TTCTTCCCGG CCCTTGGTGG CCATTCCGGT ACCGTTGTTT CGTTCGCGAC CCAGGGCGTC
GCGTTCGTCG CCCGCCCACT GGGGTCAATC CTGTTCGGGC ACTTCGGCGA TCGGCTCGGG
CGCAAGCGGA CTCTGATCTT CACGATGTCG CTGATGGGCG TGTCCACCGT GCTCATCGGC
GCGCTGCCCA CGGCGGGGAG CATCGGGGTC GCGGCACCCA TTCTGCTCGT GCTGCTCCGC
GTCGCGCAGG GCATCGCCGC CGGTGGCGAG TGGGCGGGTG CGACGCTGTT CACCTCCGAG
CATTCACCGA AGGGCCGTCG CGGCTTCTGG TCCATGTTCA CCAATCTCGG CGGCGCGCTC
GCGAACATCC TCGCCCTGTC CACGTTCCTC GCCGTGGCGC TCTACATGAG CGACGAGACC
TTCACGTCCT GGGGATGGCG CCTCCCGTTC CTGGGCAGCT TCGTGCTTAT CGCCGTCGGT
CTCTACGTGC GGCTGAAGAT CGAGGAGACC CCCGCGTTCG AGGCCGAGGC GAAGCGTCAG
CGCAGTGGTC GTGGGCTGCC GTTCAAGGAG GCTGTGGTCA ACCAGTGGAA GGAGCTCCTG
CTCGGAGCGG GCGCGTTGGT GACGGCCTTC TCGCTCGGCT ACATCGGGAT CGCATACCTG
ACCCACTACG GCACCGCCAC GCTGGGCCTG AGTCGACCCG AAGTGCTGAC GGCCGGCATT
GTCGGCAATG TTGTGAATGG TTGCGCAATC ATCTCGGGCG GCATTCTGAG TGACCGGTTC
GGCCGTCGGC GGGTCCTGCT GGCCGCCAAC ACCGTCGGCA TCCCCTGGGC GCTGGTCCTG
TTTCCGCTGC TGGACACGGG CACACTTACC GCCTTCTGGG TCGGGATGGC GGTGACCTTC
CTGATCGCCG GTCACGGATT CGGCGTCGCG GGGTCGTTCC TGTCCGAGCT GTTCCACACC
CGCTACCGCT ACACCGCGGC CGGGCTCTCC TACAGCCTCG CGGGCGTTGT CGGCGGCGCG
GTACCTCCGC TCGTCGCCGC CAGCATCATC GGAAACCACG GCGGATTCGT GTTCGGCCTG
TTTTTGGCCG CCTACTGCGT GGTGAGCCTG CTGTGCGTAC TGGCTCTGCG GGAGACCGTC
GGCAACGAAA TGATCGAGAC GCCCGCAGCG CAGCAGGGGG TGACATCGTG A
 
Protein sequence
MTLQDVPARG PASDQSVASD NGSDPKKMRK VALASLMGTV VEFYDFGIYA TAAALVFADA 
FFPALGGHSG TVVSFATQGV AFVARPLGSI LFGHFGDRLG RKRTLIFTMS LMGVSTVLIG
ALPTAGSIGV AAPILLVLLR VAQGIAAGGE WAGATLFTSE HSPKGRRGFW SMFTNLGGAL
ANILALSTFL AVALYMSDET FTSWGWRLPF LGSFVLIAVG LYVRLKIEET PAFEAEAKRQ
RSGRGLPFKE AVVNQWKELL LGAGALVTAF SLGYIGIAYL THYGTATLGL SRPEVLTAGI
VGNVVNGCAI ISGGILSDRF GRRRVLLAAN TVGIPWALVL FPLLDTGTLT AFWVGMAVTF
LIAGHGFGVA GSFLSELFHT RYRYTAAGLS YSLAGVVGGA VPPLVAASII GNHGGFVFGL
FLAAYCVVSL LCVLALRETV GNEMIETPAA QQGVTS