Gene Franean1_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1259 
Symbol 
ID5669672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1516337 
End bp1517812 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content73% 
IMG OID641240191 
Productmajor facilitator transporter 
Protein accessionYP_001505619 
Protein GI158313111 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0884652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0333983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTT CGGACCAGGT CACGTCGAGG ACGAGCGCAG GAAGCGCGGA GGAGGCAGGC 
ACCCGGGTGC CGTCGTCCAC TGGCACCTTC GCCGCCCTCC AGGTACCGAA CTTCCGCCTC
TTCCTCGGCG GTCAGGTGGT CTCGCTGTGC GGGACCTGGA TGCAGATGAT CGCCCTGGGC
TGGCTGGTGC TGTCACTCGG CGCGTCCGGC ACCGAACTCG GGCTCGTCAC CGCGGCCCAG
TTCCTGCCCG TGCTGCTGTT CGGCGCCTAC GGCGGGCTGA TCGCCGACCG CTCGAACACC
CGCAGGCTCC TGATCACCAC TCAGATCATT CTCGGTTCCC TCGCCGTCCT GCTCGGCATC
CTGGACCTGA CGGGCACGGC ACGCCTGTGG ATGGTCGCCG CCGTCGCCGC GGCGATCGGG
ATGACCAGCG CGGTGGACAA CCCGGCGCGG CAGAGCTTCG TCCAGGAGAT GGTGGGATCG
GAGTTCCTGC CCAACGCCGT CACGCTGAAC TCGGTGACCA TGAACGCGGC CCGGGTCGTC
GGGCCCGGCA TCGCCGGCAT CCTCATCAGC CTGGTCGGCA CCAGCGGCTG TTTCCTGCTG
AACGGCGCCT CGTTCGTCGC CGTGGTCATC GCGCTCCAGC GGATCGACAC CGCGGCGCTG
GTGCGCCGGC ATCCCGTGCC GCGGGCACCG GGGCAGGTGC GCTCCGGGCT CGCCTACGCG
ATGCGGACGC CGAGCCTGCG TATCCCGCTG CTCATGATGG CCGTGATCGG AGCGTTGTCC
TACGAGTTCC AGGTCGTGCT GCCGCTCGTG GCACGTGAGA CCTTTGGCGG GTCGGCCGCG
ACGTACAGCC TTCTCACCGG CGCGATGGGT GCGGGGGCCG TGGCCGGTGG CCTGGTCGTC
GCCCGGCACC GGCGGGTGGG GGTCCCGGCT CTGGCGGTCA CCTCCGGGGT GTTCGGCGTG
GTCACCCTGG TGGCAGCCGC GGCCCCGGTG CTGGCGCTGG AGGTCGCCGC GCTCGTGGTG
GTCGGCGCGG CGAGCGTCGC GTTCATCTCC ACCGGCAACG CGACCGTGCA GCTCTCCGCG
GCACCGGAGA TGCGTGGCCG GGTGATGGCC CTGTGGTCGG TGGCGTTCCT CGGCTCGACC
CCGGTCGGCG GCCCGATCGC GGGCTGGGTG TCCGAGACGT TCGGTGCGCG GGCCGGCCTG
GCGATGGCCG GCGCGGCGGC GTTGGCCGGA TCCGCCTTCG CGGCGGCGTC CCTGCGCAGC
CGGGCCGCGC GCACCCAGGC GGTCCCGGCG GCTGCGGCCA TCTCCCGGCC ACCGGCCGCG
CCAGCCGGCG ACCCAACACC CGCCACCGCC AGCGACCCGG CGGCCGCCAC GGCGACCGAC
ACCGCGCTCG GCAACCCGGT GGAGGCGGTG CTGAGCGCCG CGAACGCCGC TGCCGCACCG
ACACCGAAAC ACGGACCGGC CCTACAGGAC GTCTGA
 
Protein sequence
MSRSDQVTSR TSAGSAEEAG TRVPSSTGTF AALQVPNFRL FLGGQVVSLC GTWMQMIALG 
WLVLSLGASG TELGLVTAAQ FLPVLLFGAY GGLIADRSNT RRLLITTQII LGSLAVLLGI
LDLTGTARLW MVAAVAAAIG MTSAVDNPAR QSFVQEMVGS EFLPNAVTLN SVTMNAARVV
GPGIAGILIS LVGTSGCFLL NGASFVAVVI ALQRIDTAAL VRRHPVPRAP GQVRSGLAYA
MRTPSLRIPL LMMAVIGALS YEFQVVLPLV ARETFGGSAA TYSLLTGAMG AGAVAGGLVV
ARHRRVGVPA LAVTSGVFGV VTLVAAAAPV LALEVAALVV VGAASVAFIS TGNATVQLSA
APEMRGRVMA LWSVAFLGST PVGGPIAGWV SETFGARAGL AMAGAAALAG SAFAAASLRS
RAARTQAVPA AAAISRPPAA PAGDPTPATA SDPAAATATD TALGNPVEAV LSAANAAAAP
TPKHGPALQD V