Gene Franean1_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0437 
Symbol 
ID5668860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp518334 
End bp519530 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID641239369 
ProductBcr/CflA subfamily drug resistance transporter 
Protein accessionYP_001504808 
Protein GI158312300 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTCCG CACGCCGCAC CCTGCTCGTG CTCGGCTTCC TCGTCGCGCT CGGGCCGTTC 
ACGGTCGACT TCTACATTCC GGCGTTCCCG CTCGTCCAAG CCGACTTCGG CACGAGCGCA
GCCGCCGTGC AGCTCACGCT GACGGCCACC ACCATCGGCT TCGCGCTGGG CCAGCTGGCC
ATCGGCCCGT GGAGCGACAG CATCGGGCGG CGCCGGCCCC TGCTGGTGGC CACCGCCCTG
CACGTGGCCG GCAGCCTGGG CGTCGCCGCG GCGCCCACCG TCGAAGTCAT GCTCGTCTTC
CGGCTGCTGC AGGGCGCCGG AGCCGCCGGC AGCGGCGTCG TGGCCCTGGC CATGGTGCGC
GACCTCTTCG ACGGTGCCCT GTTCGTGCGG ATGGCAGCCC GGCTCGCCGT GGTGACCGGG
CTCGCGCCGG TTGTCGCACC TTTCGCCGGT TCGCTGATGC TGAGCCACAT GTCCTGGCGA
GGCTTGTTCG TCTGCATCGC GCTCTACGGC TTGGCGGTGC TCGCCGTCGC GGCGTTCCTG
GTCCGGGAGA CGGCCCCGTT GGTGCGGCGA GCGGGGGCGC CGCTCGGGCG TTACCGCGTG
CTGGTGACGG ATCGCGGCTT CGTCGGCGCC GCCCTCGCCG GTGGTCTGCT GGTCTCCAGC
GTCTTCACCT ACATGAGTTC GTCGTCGTTC CTCTTCCAGG AGACGTACGG CCTCTCCGCC
CAGCGGTACA GCCTGGTCTT CGCGGCGAAC GCGGTGGGCT TCGTGATCGG GGCGCAGACG
TCAGCGCGGC TCGTCACCCG GATCGGGCCG CGGCGGCTGC TCCGATACGT ACTGCCCTCG
CTCGGCTTCC TCGGCTTCAC CCTGCTGCTC GCGGCATTCG CCGGCGAGAA CGTGGTGGTG
GTGACCTTCG TGACGGCGCT CTACTTCCTC CTGGCAGGCG CCGTCGGGCC CTGCCTGCAG
GTGATCGGCA TGGCCCCGCA CGGGGAGAGG GCCGGAACCG CCGCCGCGCT GATGGGCGCC
GCGAACTTCG GCCTGGCCGG CGCGACCGCG CCGGTGGCCG GACTACTCGG CGTCGGCTCG
ATCGGCCCGA TCGGCCTCGT CATGGGGCTG ACCATGACGG TCGCGGTGGT TGTCTTCCGG
GTGTTGGCCC GCGACCGGCG CGAAGCGCGG GCCGGGGCGC CTTCGCCGGT CCCTTGA
 
Protein sequence
MNSARRTLLV LGFLVALGPF TVDFYIPAFP LVQADFGTSA AAVQLTLTAT TIGFALGQLA 
IGPWSDSIGR RRPLLVATAL HVAGSLGVAA APTVEVMLVF RLLQGAGAAG SGVVALAMVR
DLFDGALFVR MAARLAVVTG LAPVVAPFAG SLMLSHMSWR GLFVCIALYG LAVLAVAAFL
VRETAPLVRR AGAPLGRYRV LVTDRGFVGA ALAGGLLVSS VFTYMSSSSF LFQETYGLSA
QRYSLVFAAN AVGFVIGAQT SARLVTRIGP RRLLRYVLPS LGFLGFTLLL AAFAGENVVV
VTFVTALYFL LAGAVGPCLQ VIGMAPHGER AGTAAALMGA ANFGLAGATA PVAGLLGVGS
IGPIGLVMGL TMTVAVVVFR VLARDRREAR AGAPSPVP