Gene Franean1_6826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6826 
Symbol 
ID5675139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8318701 
End bp8320272 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content72% 
IMG OID641245675 
Productmajor facilitator transporter 
Protein accessionYP_001511066 
Protein GI158318558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCA GCGAAGTCTC GAACGGAGCG AGCCCGGACT CCGGCCCGGG CTCCGGAGCC 
GGGACGGACG CTCCGTCACG AGCCAGCGCC ATCGTCGCCG TCCTGGCCGC GGTCGGCGTC
CTGGTCTCGC TCATGCAGAC GCTGATGGTG CCGCTGATCC CGGTACTGCC GAAGCTCCTG
CACTCCAACG CGAGCGACGC GTCCTGGGCC ATCACGGCGA CCCTGCTCAC CGGGGCCGTC
GCGAACCCGG TCTTCGGCCG GCTCGGCGAC CTGTTCGGCA AGCGGCGGAT GCTCCTGCTC
TCCGGCTACA TCCTCGTGGC GGGCTCGCTG GTCTGTGCCC TGACCGACTC CCTGGTGCCG
ATCGTGGCAG GCCGGGCCCT GCAGGGCCTC GGCCTGGCGA TCATCCCGCT GGGCATCAGC
ATCATGCGTG ACCTTCTCCC GCCGAAGCGG CTGATCCCGG CCATGGCCCT GATGAGCTCG
TCGCTCGGCA TCGGCGGCGC GCTGGGACTG CCGATCGCGG CGATCGTCGC GCAGAACCTC
GACTGGCATG TGCTGTTCTG GGGTTCGGCC ATCGCCACCC TGATCCTCGT GGCGCTGGTC
ACGGTCGTGG TCCCCGAGTC CCCCGTCCGG GGTTCCGGCA GCTTCGACCT GCCCGGAGCA
GTGGCCCTCT CCGCGGGCCT CGTCGCGCTG CTGCTCGCCG TGTCGAAGGG AAGCACCTGG
GGCTGGTCCA GCGCCACCAC CCTGGGGCTG TTCGGAGCCG CGGTCGCCGT CCTGCTGGCC
TGGGGCCGGT GGGAGACCCG CGCGAAGGCC CCGCTGGTCG ACCTGCGCAC CTCGACCCGG
CGCCCGGTGC TCCTGACGAA CCTGTCCTCC ACCGTGCTGG GCTTCGCGAT GTACGCGATG
TCGCTGATCT GCCCGCAGAT CATGCAGCTA CCCAGGGCCA CCGGGCACGG CCTCGGCCAG
TCACTGCTCG CCACCGGCCT GTGGATGGCG CCGGCGGGGC TGATGATGAT GGTCGTCTCG
CCCTTCGCTG GACGCCTGAT CACCGCCCGC GGGCCGAAGG TCGCCCTCCT CTCCGGCACA
GCTGTGATGA CCGTCGGATA CGTCGCCGCG CTCGGGCTGA TGGGCAGCCC CGTGGGCGTC
CTGGTCATCG CCTGTTCGAT CAGCGGCGGC GTGGGGCTCG CCTACGCGGC GATGCCAACC
CTGATCATGG CCTCGGTGCC CGCTTCCGAA GGCGCCGCCG CCAACGGCCT CAACACCCTG
ATGCGCTCCA TCGGGACGTC GACGGCCAGT GCCGTGATCG GCGTCGTGCT GGCGAACATG
ACCATCTCCT TCGGGACGAC GCAGGTCCCG TCACTGACCG GCCTGCGCGT CGGCTTCCTG
ATCGGCGCCG GCGCCGCACT GGTGGCCTTC CTGGTAGCCC TCGCCATCCC GGCCCGCAAG
TCGGCCGCAC CCGCCTCCGT CGTTCCCGAC CAGCGCAGCC CGCACGACCG GTCGACCGGG
GCCGCTGGGG CCGCCGCCGG TTCCGTGGCG GAGGGCGCCG CAGCGACGGA CGCGGTCGAG
GCAAGGGCCT GA
 
Protein sequence
MDASEVSNGA SPDSGPGSGA GTDAPSRASA IVAVLAAVGV LVSLMQTLMV PLIPVLPKLL 
HSNASDASWA ITATLLTGAV ANPVFGRLGD LFGKRRMLLL SGYILVAGSL VCALTDSLVP
IVAGRALQGL GLAIIPLGIS IMRDLLPPKR LIPAMALMSS SLGIGGALGL PIAAIVAQNL
DWHVLFWGSA IATLILVALV TVVVPESPVR GSGSFDLPGA VALSAGLVAL LLAVSKGSTW
GWSSATTLGL FGAAVAVLLA WGRWETRAKA PLVDLRTSTR RPVLLTNLSS TVLGFAMYAM
SLICPQIMQL PRATGHGLGQ SLLATGLWMA PAGLMMMVVS PFAGRLITAR GPKVALLSGT
AVMTVGYVAA LGLMGSPVGV LVIACSISGG VGLAYAAMPT LIMASVPASE GAAANGLNTL
MRSIGTSTAS AVIGVVLANM TISFGTTQVP SLTGLRVGFL IGAGAALVAF LVALAIPARK
SAAPASVVPD QRSPHDRSTG AAGAAAGSVA EGAAATDAVE ARA