Gene Franean1_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3651 
Symbol 
ID5672018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4327383 
End bp4328936 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content72% 
IMG OID641242535 
Productmajor facilitator transporter 
Protein accessionYP_001507955 
Protein GI158315447 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCA TCGAGAACAC CGGCCACCGC GGCTCGGCCA CCGGCCACCG CGCGGACGGC 
GCACTGCCCG TCAACGCGAT CATCGCCGTC CTGGGCGCCG TGGGCATCGT CGTCGCGATG
ATGCAGACCC TGATGGTGCC GCTGATCCCG GTGCTGCCGT CGCTGCTGCA CACGCACGCG
GCGGACGCAT CGTGGGCCAT CACGGCGACC CTGCTCGCCG CGTCGGTCGC GAACCCGGTG
TTCGGGCGGC TCGGCGACCT CTTCGGCAAG CGGCGCATGC TGCTCGTCTC CGGGTTCGTG
CTGGGCTGCG GCTCCCTGGT CTGTGCCCTG AGCGACACTC TGGTGCCGAT GGTCGTCGGC
CGGGCGATGC AGGGCCTCGG GCTGGCGATC ATCCCGCTGG GCATCAGCAT CATGCGTGAC
CTGCTGCCGG CGAAGCGGCT GATCCCCGCC ATGGCGCTGA TGAGCTCCTC GCTCGGGATC
GGTGGCGCCC TGGGCCTGCC GCTCGCCGCC GCGGTCGCCC AGCAGACGAA CTGGCACGTG
CTGTTCTGGG GCTCGACCGT GGCGGTCGCC CTGCTGATGG TCCTGGTGTG GCGGGTCGTT
CCCGAGTCGC CGGTGCGCGG CACGGGCCGG TTCGACCTGC CGGGGGCGAT CCTGCTCTCC
GGAGGGCTCG TCGCGCTGCT GCTCGCCGTG TCGAAGGGAA GCACCTGGGG CTGGACCAGC
ACCACCACCC TCGGCCTGGG CGGAACGGCG GCCGCGCTCC TCGTCGCCTG GGCCTGGTGG
GAGACCCGCG TCGAGGCGCC CCTCGTGGAC CTGCGCACCA CCATCCGGCG CCCGGTGCTG
CTGACGAACC TCGCCTCGGT CATGCTCGGC TTCGCGATGT ACACGATGTC GCTGATCGGC
CCGCAGCTGC TGCAGCTGCC GAAGGCCACC GGGCACGGGC TCGGCCAGTC GCTGCTCGCC
ACCGGCCTGT GGATGGCCCC CGCCGGAATC GTCATGATGG CCGTCTCGCC CATCGCCGGC
CGGCTGATCA CGGTGCGCGG CCCGAAGGTG GCGCTCCTCG CCGGCTCGGC CGTGATCTCG
GCCGGCTACT TCCTCGCCAT CGGTCTCACC GGCAGCCCGC TGGGAGTCCT GCTCGTCAGC
GTCGTGACCA GCGCCGGTGT CGGCCTGGCC TACGCCGCCA TGCCCACCCT GATCATGCAG
GCCGTCCCGG CCTCGGAGGG CGCCTCGGCG AACGGTCTCA ACACCCTCAT GCGCTCCATC
GGGACGTCCA CCGCGAGCGC GGTGATCGGT GTGGTCCTGG CGAACATGAC CGTCACCTTC
GGCTCGACCC AGGTGCCGTC CCTCGCCGGT ATGCACATCG GATTCCTGAT CGGCGCCAGC
GCCGCCCTCG TCGCCTGCCT GGTCGCCATC GCCATTCCCG GCCGCCTGGC CCCGACACCG
GACGTCGCCT CGAAGCCCGA CGTGGCCCTG AAGCCGGACG TCGTTCTCCC GGCGCCCCGC
CCGTCGTCGC CGCGGACGGC CGAGCCGGAG ACGGCAGCCG TCTCCGCCCG CTGA
 
Protein sequence
MASIENTGHR GSATGHRADG ALPVNAIIAV LGAVGIVVAM MQTLMVPLIP VLPSLLHTHA 
ADASWAITAT LLAASVANPV FGRLGDLFGK RRMLLVSGFV LGCGSLVCAL SDTLVPMVVG
RAMQGLGLAI IPLGISIMRD LLPAKRLIPA MALMSSSLGI GGALGLPLAA AVAQQTNWHV
LFWGSTVAVA LLMVLVWRVV PESPVRGTGR FDLPGAILLS GGLVALLLAV SKGSTWGWTS
TTTLGLGGTA AALLVAWAWW ETRVEAPLVD LRTTIRRPVL LTNLASVMLG FAMYTMSLIG
PQLLQLPKAT GHGLGQSLLA TGLWMAPAGI VMMAVSPIAG RLITVRGPKV ALLAGSAVIS
AGYFLAIGLT GSPLGVLLVS VVTSAGVGLA YAAMPTLIMQ AVPASEGASA NGLNTLMRSI
GTSTASAVIG VVLANMTVTF GSTQVPSLAG MHIGFLIGAS AALVACLVAI AIPGRLAPTP
DVASKPDVAL KPDVVLPAPR PSSPRTAEPE TAAVSAR