Gene Franean1_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2254 
Symbol 
ID5670653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2693231 
End bp2694754 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content74% 
IMG OID641241174 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001506595 
Protein GI158314087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0213589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAG GAGAGTCCAT GTCCCAGCCC TTCCCATCAC CGGCGGCGGC GAGCCCGGCT 
GCTCCCGGCC GGACATCCCC ACCTCCCGAT GTGGGAGCCC CGCGGGCCGC CGGCTCCCGG
CTGCTCGCGC TGTGTGTCCT GTGCGCGTCG ATGCTCATGG TCATCCTGGA CGGCACCATC
GTCACGGTCG CGCTGCCCAC CATTCAGGAT GACCTGGGCT TCTCCGCGTC CGGTCTGGCG
TGGGTCGTCA ACGCCTACCT GGTTGCCTTC GGTGGTCTTC TGCTGCTGGC CGGCCGCATG
GGGGATCTCG TCGGCCGCCG GCGGGTCTTC GTCACCGGAC TGGTGCTGTT CACCGTGGCC
TCCCTGCTCT GCGGGCTGGC GACCGGGGCG GGCACGCTGA TCGCCGCTCG GTTCGTGCAG
GGCGTCGGCG GGGCGCTGTC CTCGTCGGTG GTGCTCGGGA TGATCGTGGT GGCCTTCCCC
GAGCCCCGGG AGCAGGCACG GGCGATCGGG GTGTTCAGCT TCGTCGGCGC CGCCGGGGCG
TCGATCGGCC TGCTCGCCGG CGGCCTGCTC GTCGAGACGC TGACCTGGCA CTGGATCTTC
TTCGTGAACG TGCCGATCGG CGCGGTCGCG GTCGTGCTGT CGCTGCGGGT GCTGCCCACC
GAGCGCGGTC CCGGCCTGCG CGCCGGCGCG GACGCTCCCG GGGCGGTGCT GGTCACCGCG
GGTCTCATGC TCGGCGTCTA CACGATCGTC GGCACCGCCG ACGCGGGCTG GGCCTCCGCC
CGGACCATCG GCCTCGGCGC ACTCGCCCTC GCCCTGCTGG CCGCCTTCGC CGCCCGCCAG
GCGACGGCGG CTCACCCGTT GCTGCCGCCA CGGCTGTTCT CCTCCCGGCC GCTGACCGTC
GCGAACATCG TCCAGACGCT GATGGCCGGC GGGCTGTTCG CCTTCCAGTT CGTGCTCGCG
CTGTTCCTGC AGCGTGTCCT CGGCTACGGG CCGGCCGAGA CGGGCCTGGC GTTCCTGCCC
ATCGCGGCGA CGATCGGCGC GTTCTCGCTC GGGCTCTCGG GACGGCTCGC CCACCGCTTC
GGCGCGGGCC TGGTGCTGCT GCCCGGTCTC GTCCTGCTCG GCCTCGGCCT GTGGCTGCTG
TCCCGCCTGG CCCCGGACGC GGCCTACGCG ACGGACGTGC TGCCGGTCGC GGTCGTCCTG
GGGTGCGGCG GCGGCCTGAC GCTGCCGGCG CTCACCCAGC TCAGCATGAC CGGGGTGCCG
CCGGACGACG CCGGGCTCGC CTCCGGCCTG GCCAACACCA CATTGCAGGT CGGCGGGGCG
CTGGGGCTGG CGGTCCTGAC CACGCTGGCG GCGTGGCGCA CCGGGAACGC GGCGGACGGC
GGCGCCGGGC CGGCGGAGGC CCTCACCGCC GGCTACCACC TCACCTGGAT CGGCGGCGCC
ATCCTGATGG TCGCCGGACT GCTCCTCACC GTCGCCTTCC TCCGGCCGGA CGGGAGCCGG
AAGCAGAGTT CCGCGGACGG TTGA
 
Protein sequence
MPQGESMSQP FPSPAAASPA APGRTSPPPD VGAPRAAGSR LLALCVLCAS MLMVILDGTI 
VTVALPTIQD DLGFSASGLA WVVNAYLVAF GGLLLLAGRM GDLVGRRRVF VTGLVLFTVA
SLLCGLATGA GTLIAARFVQ GVGGALSSSV VLGMIVVAFP EPREQARAIG VFSFVGAAGA
SIGLLAGGLL VETLTWHWIF FVNVPIGAVA VVLSLRVLPT ERGPGLRAGA DAPGAVLVTA
GLMLGVYTIV GTADAGWASA RTIGLGALAL ALLAAFAARQ ATAAHPLLPP RLFSSRPLTV
ANIVQTLMAG GLFAFQFVLA LFLQRVLGYG PAETGLAFLP IAATIGAFSL GLSGRLAHRF
GAGLVLLPGL VLLGLGLWLL SRLAPDAAYA TDVLPVAVVL GCGGGLTLPA LTQLSMTGVP
PDDAGLASGL ANTTLQVGGA LGLAVLTTLA AWRTGNAADG GAGPAEALTA GYHLTWIGGA
ILMVAGLLLT VAFLRPDGSR KQSSADG