Gene Franean1_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2993 
Symbol 
ID5675706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3519023 
End bp3520846 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content66% 
IMG OID641241896 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001507316 
Protein GI158314808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.999579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCACA CCAACGTGCA GCCGTCTAAC TTGCACAGTA CTGAGCAAGT AGGCGCTCGA 
AGTGCCATGA CACGGGAGGG CATGACAGTG CACCTGATCG AACGTAGGCC GATCGAACCG
ACCGATACCC CGTACCCGCG GCGCTGGGCC GGACTCGCGG TCCTGTGCCT GGCCCTTCTG
ATCATCGTGA TGGCCAACAC CTCGCTGATC GTGGCCATCC CCGACATGGT CCGGGACCTC
TCACTGACCA GCGCGGATCA GCAGTGGACG ATTGACGCCT ACACCGTTCC GTACGCGGCA
CTGATGCTGG TGCTCGGCGC GCTCGGCGAC CGGTACAGCC GGCGCGGCGG TCTCATCGTC
GGCCTTCTTC TGTTCGGTGC CGGTTCGATC GTGGGAGCGC TGGCCGACAG CACGGCAGCG
GTGGTCATCG CGCGGGCCAT CATGGGTGTC GGCGCGGCCA CTATCATGCC TGCGACGCTG
TCACTACTGG TCGCAACATT TCCGAAACGG GAACGTACGC TCGCCATCAC CATATGGACC
ACCACCTCAG GCCTTGCCAT CGCCCTCGGT CCGCTCCTTG CGGGACAGCT CTTGGAAAGC
TATCCCTGGA ACTCCACGTT TCTGATCAAT CTCCCGATCG CGGCGATCAC GATCGGTGCA
ACGATGGTCT GCATCCCGCC CTCGAAGGCC AACGGCCCAG GACGCTCCGA TATCGTCGGT
GGGCTGCTGT CGATCGCCAC GATCGCTGCA CTCATCTACG CGATCATCGA GGGACCGCAC
TTCGGGTGGG ACACCTACCC GATCATCGCG GCGGTCGTCG CGGCCGTTGG CCTGGTCGCA
TTCGTCCTGT GGGAGCTGCG GACCCGGACT CCGATCCTGG ATGTCCGCCT GTTCCGGATC
CGGGCCTTCT CCGGATCTAC GCTCGCAGTG TTGCTGTTCT TCCTCGGCAC CTTCGGCGCG
ATCTACTACA TCTCGCAGTA CCTGCAGTTC GTACTCGGCC TCGGCCCGTT GGACACCGGT
ATCCGGCTGC TTCCGCTAGC CGGCGCGGTC TTCGTCGGCG CGGCACTGAC TGGACGGATG
ACCCCACGAC TCGGCGCGAA GGTGACTGTC CCCGCCGGAA TGGCGATCGG CGCGGCCGGC
ATCCTGCTGA TGACCGGCCT CGACGACGGC TCCGGATACA CCGACTTCCT CGCGCCGCTG
ACCATGCTCG GCCTAGCGAT CGGTCTGAGC GTCGCGCCGT GCACCGATGC CATCATGGGC
GTCTTCCCCG AGGACGAGCT GGGCGTCGGC GGCGCCGCGA ACGACACCGC AGTGGAGCTC
GGCGGGTCAC TGGGCATCGC CATCCTCGGG TCGATCCTCG CCACCTCCTA CAAGACCGAC
CTCACCGGCA CGGTCGGCAG CCAGCTACCG CATGACCTGC ACGAACCTGT CCTGGACTCG
GTCGGCGGCG CGATCAAGGT TGTCCAGGGC CTGGCCGAGC AGGGCGTGCC GCAGGCCGGA
CCACTCGCCG ACGCCGCCCG CCACTCCTTC ATCGAAGCCA TCACCCAGAC CAGCCTGGTC
GGCGCCATCA TCCTCACCGT GGGAACGGTG CTGGTGGGCC TCATTCTCCC CCGCCACAGC
ACACCGACGG CGCCGGCCGA ACCAACGGAA GCAAACGCAG TCGGCAAACC CGAACCGGCT
GGTAGCCCTG CCGGCCCCGG TCGGCGTCAC GTTGGAACTC CGCCAGCACC CGATGGCTCC
GATCGACCCA GCCGAAGCTC GCGGCACGGA CGGGAAATCT TCGCGCACTC ATCCGCCAGC
CCGAGGCATC CGAGAACCCG GTAA
 
Protein sequence
MLHTNVQPSN LHSTEQVGAR SAMTREGMTV HLIERRPIEP TDTPYPRRWA GLAVLCLALL 
IIVMANTSLI VAIPDMVRDL SLTSADQQWT IDAYTVPYAA LMLVLGALGD RYSRRGGLIV
GLLLFGAGSI VGALADSTAA VVIARAIMGV GAATIMPATL SLLVATFPKR ERTLAITIWT
TTSGLAIALG PLLAGQLLES YPWNSTFLIN LPIAAITIGA TMVCIPPSKA NGPGRSDIVG
GLLSIATIAA LIYAIIEGPH FGWDTYPIIA AVVAAVGLVA FVLWELRTRT PILDVRLFRI
RAFSGSTLAV LLFFLGTFGA IYYISQYLQF VLGLGPLDTG IRLLPLAGAV FVGAALTGRM
TPRLGAKVTV PAGMAIGAAG ILLMTGLDDG SGYTDFLAPL TMLGLAIGLS VAPCTDAIMG
VFPEDELGVG GAANDTAVEL GGSLGIAILG SILATSYKTD LTGTVGSQLP HDLHEPVLDS
VGGAIKVVQG LAEQGVPQAG PLADAARHSF IEAITQTSLV GAIILTVGTV LVGLILPRHS
TPTAPAEPTE ANAVGKPEPA GSPAGPGRRH VGTPPAPDGS DRPSRSSRHG REIFAHSSAS
PRHPRTR