Gene Franean1_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3397 
Symbol 
ID5671768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4026654 
End bp4028168 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content71% 
IMG OID641242285 
Productmajor facilitator transporter 
Protein accessionYP_001507705 
Protein GI158315197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCA GTGCCGACAG TGGCGGTTCC GCCCCGGTCG CTACCCGTCG TGTGGACGAG 
TCGCGCGTCA ACGCGATCAT CGCTGTCCTG GGCGGGATAG GCGTCGTCGT CGCGATGATG
CAGACGTTGA TGGTGCCGCT ACTGCCGACG CTGCCATCGC TGCTGCACAC CAGCTCGGCG
AACGCCTCGT GGGCGATCAC GGCCACGCTG CTCACCGCGT CCGTCGCCAA CCCGGTGTAC
GGGCGGCTCG GTGACCTCTA CGGCAAGCGG CGCATGGTCT TCGTCGCCGG CACCGCGCTC
GCCTGCGGCT CGGTGGTGTG CGCCCTGAGC AGCTCGCTCG TGCCGTTGCT GGTGGGCCGG
TCGATGCAGG GCCTCGGCAT GGCGATCATC CCGCTGGGCA TCAGCATCAT GCGTGACCTG
CTGCCGGCGA AGCGGCTGAT CCCCGCCATG GCGCTGATGA GCTCCTCGCT CGGGATCGGG
AGCGCGCTGG GCCTGCCGAT CGCGGCGGCG GTCGCCCAGC AGGCCAACTG GCACGTGCTG
TTCTGGGGCT CGGCTGTCGC CGTCGTCGCC CTGATGGTGC TGATCTGGCG GGTCGTTCCC
GAGTCGCCGG TCCGCGGCAC GGGCCGGTTC GACCTGCCGG GGGCGATCCT GCTCTCCGGA
GGGCTCGTCG CGCTGCTGCT CGCCGTGTCG AAGGGAAGCA CCTGGGGCTG GACCAGCACC
ACGACCCTCG GCCTGGGCAT GGTGGCCGCC GCCCTCCTCG TCGCCTGGAC CTGGTGGGAG
GCCCGCGCCG AGGCCCCCCT CGTGGACCTG CGCACCACCA TCCGGCGCCC GGTGCTGCTG
ACGAACACCG CTTCCGTTGC ACTGGGCTTC GCGATGTACG CGAACTCGCT GATCAACCCC
CAGCTGCTGC AGCTGCCGAA GGCCACCGGG CACGGGCTCG GCCAGTCGTT GCTCGCCACA
GGCCTGTGGA TGGCCCCCGT GGGGCTGGTG ATGATGGCCG TGTCGCCCAT CGCCGGCAGG
CTGATCACGG CACGCGGACC GAGGACCTCG CTCATCGCCG GCTCGGTCGT GATCGCCGGT
GGCTACTGCC TCGCACTTGG GCTCACCAGC AGCCCGCCGG GAGTCCTTCT CGTCAGCTGC
GTGATCAGCA CCGGCGTCGC ACTGGCTTAC GCGTCCATGC CCACTCTGAT CATGCAGTCC
GTGCCGGCCT CCGAGGGCGC CGCGGCGAAC GGCCTCAACA CCCTCATGCG CTCCATCGGA
ACCACGGGGG CGAGCGCGGT GATCGGCGTG GTCCTGGCGA ACATGACCAT CCCGTTCGGA
TCGACCCGGG TGCCCTCCCT CGCCGGCCTG CACGTCGGAT ACCTGATCGG CGCCGGTGCC
GCGCTGATCG CCGGCCTGCT GGCCCTCGGC ATCCCCGGCC GCGCGGCGTC GAAGTCGACC
GTCACGCTCC CGGAACCACG CCGGACGTCG CCGCAGGCGG CCCAGCCCGC GGCGACATCC
ACCTCAAGCG TCTGA
 
Protein sequence
MASSADSGGS APVATRRVDE SRVNAIIAVL GGIGVVVAMM QTLMVPLLPT LPSLLHTSSA 
NASWAITATL LTASVANPVY GRLGDLYGKR RMVFVAGTAL ACGSVVCALS SSLVPLLVGR
SMQGLGMAII PLGISIMRDL LPAKRLIPAM ALMSSSLGIG SALGLPIAAA VAQQANWHVL
FWGSAVAVVA LMVLIWRVVP ESPVRGTGRF DLPGAILLSG GLVALLLAVS KGSTWGWTST
TTLGLGMVAA ALLVAWTWWE ARAEAPLVDL RTTIRRPVLL TNTASVALGF AMYANSLINP
QLLQLPKATG HGLGQSLLAT GLWMAPVGLV MMAVSPIAGR LITARGPRTS LIAGSVVIAG
GYCLALGLTS SPPGVLLVSC VISTGVALAY ASMPTLIMQS VPASEGAAAN GLNTLMRSIG
TTGASAVIGV VLANMTIPFG STRVPSLAGL HVGYLIGAGA ALIAGLLALG IPGRAASKST
VTLPEPRRTS PQAAQPAATS TSSV