Gene Franean1_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3645 
Symbol 
ID5672012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4318859 
End bp4321204 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content67% 
IMG OID641242529 
ProductMMPL domain-containing protein 
Protein accessionYP_001507949 
Protein GI158315441 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.197426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA TCGCTGAACT CGCTGTCCGT CGCCGATGGT TCGTCGTCGT CGGCTGGGTT 
GTCTTCATCG TCGCGGTGCA GGGGATCGCC GGGGCGATGG GCGGGGCTTC GTACAAGGAC
ACGTTCAGCC TCCCGCACAC CGAGACCGCG TCCGTCGCGA AGCTCCTCGA GGATGCCGGC
CTGAACAATC AGAACGGCGC CCCGGGCACG GTTGTGATCA AGAACGAGAG CGGAACGCTC
ACCGAGCCGC CACCGAAGCT GAAACCGGCC CTGGCCGAGG TGTGCGCTTC GGGTAACCAT
GTGGCACTGA TCGCGTCGCC CTGGGAGTCG ATCGACTGCT CGAAGAGCGA TGCCGAAGCG
CCGGGAAACC CACAGCTGCT CAACAGTGCG CGCGGCTCCA CCACGGCCCT GGTCACCATC
ACCTGGGAGA ACGACCACTA CGACGCCGAG CTGTTCAAGA ACGTCTACGA TCAGCTGAAG
ACGCTGCGCA GCGATTCGCT GCAGGTCGAG TTCACCGGTA ACGCCTTCAC CGGCATCGGG
CAGAGCGACG GCTCGGGCTC GTCGGTGTTC ATCGGATTCG CGGCTGCCCT CATCATCCTG
GCGCTGGTGT TCCGTACCGT GGCCGCCACG GTGCTGCCGC TGGCCAGCGC GGTGGCCGCG
CTCGTCAGCG GCCTCGGCGT GATCTACATC CTCAGCCACG CCATCAACGT CTCCAACATC
ACCCCGTACC TGGCCGAGCT GATGGTGATC GGCGTCGGCG TCGACTACGC GCTGTTCATC
CTCACGCGGC ATCGCCGCAA CCTGCGGCGC GGCATGCCCG TCGCGGATTC GATCGTGAAC
GCGCTCAACA CCTCCGGCCG GGCGGTGCTG TTCGCCGGTA CGACCGTGTG CATCGCCATC
CTCGGTCTGA TCGCACTCGG GGTGAGCTTC TTCAACGGCA TGGCGGTGGC GACCGCGCTC
GCGGTCGGCT TCACCATGAT TGCCTCGCTG ACGCTGCTGC CCGCATTGCT GGCCATCTTC
GGCCTGAAGG TGCTGCCCCG CCGGCAGCGG GCGGCGGTGC GGGCCGGTGA GTTCATCGAT
GACCGTCCGG TGGGGGGCTG GGCCCGGTGG TCGCGGTTCG TCGCTGGGCG CCCCGTCGTC
GTCGCGATCG TCTCGGGCGC GATCATGGTC GCGATCGCGC TGCCGTTCTT CTCGATGGAG
CTGGGCGCCA GCGATCAGGG CAGCGACCCG AAGAGCTCCA CGACCCGCGA CGGTTACGAC
CTGATCGCCT CCGATTTCGG CGTCGGCTAC AACTCCACTC TGGAAGCCGT TGTGAGCGGC
CCGGGCGCCT CGGACCAGGC CTACCTGCAG CGCGTGACAA AGACGCTGTC CGCTGTCCCG
GGCATCGACC CGGGCAGCCT GGGCACGGTT CCGCTCGCTG AGAACGTCGC CTTCGTGACG
TTCAAGACGA CCACGTCACC GCAGTCGGAG AAGACCTACG AGCTGGTCCG GCACCTGCGC
TCGACCACCC TGCCGCCGCT GTACGACGGC ACGGCCAACC ACATCTACAC CTACGGTGAC
ACGGCGATCA ACGTCGACTT CGCCGCGGTG CTTGCCCGGA AGATGCCGCT GTTCATCGCG
GTCGTGGTCG GCCTGTCGTT CGTCCTGCTG CTCGTCGCGT TCCGGAGCCT GGTCATCCCG
CTGACCGCCG CGGTGATGAA CCTGCTGGCA GCGGGCGGTT CGTTCGGTCT GGTTGTGGCG
ATCTTCCAGT ACGGCTGGCT CTCCGACAGC ATGGGCGCCG GACCAGGCGG ACCGATCGAC
GCCTGGATCC CGGTCATGCT GTTCGCCATC CTGTTCGGCC TGTCGATGGA CTACCAGGTG
TTCCTGGTCA GTCGCATGCA TGAGGAATGG GTACACACCC GCGACAACAA GCGATCGGTG
ACCATCGGGC AGGGCGAGAC CGGCGGCATC ATCACCGCCG CCGCCATCAT CATGATTGCC
GTCTTCCTCG GCTTCGTGGT CAGCCCGGGC CGGCCGATCA AGATCTTCGG TACCGGCCTC
GCCGCCGCCG TGTTCATCGA CGCGTTCGTT CTCAGGACAA TGCTCGTACC GTCGCTGATG
CACATTGTCG GCAAGGCGAA CTGGTACCTT CCGAAATGGC TGGACCGCAT CACTCCGCGA
GTCTCGGTCG AACCAGCCGA CGAGGCCGTC CCCCACAGCG TGGGCACCGG CTCCTTCGAC
ACCGACCGGC CTGAAGGCGA CGACGACCGG CCTGAAGGCG ACACCGACCG GCCCGAAGAC
GAGGTCGACC GGCCCGAAGA CGACGACGAC CGGCCCGAAG ACGAGCGGGA GCTGGCCCGC
TCCTGA
 
Protein sequence
MKRIAELAVR RRWFVVVGWV VFIVAVQGIA GAMGGASYKD TFSLPHTETA SVAKLLEDAG 
LNNQNGAPGT VVIKNESGTL TEPPPKLKPA LAEVCASGNH VALIASPWES IDCSKSDAEA
PGNPQLLNSA RGSTTALVTI TWENDHYDAE LFKNVYDQLK TLRSDSLQVE FTGNAFTGIG
QSDGSGSSVF IGFAAALIIL ALVFRTVAAT VLPLASAVAA LVSGLGVIYI LSHAINVSNI
TPYLAELMVI GVGVDYALFI LTRHRRNLRR GMPVADSIVN ALNTSGRAVL FAGTTVCIAI
LGLIALGVSF FNGMAVATAL AVGFTMIASL TLLPALLAIF GLKVLPRRQR AAVRAGEFID
DRPVGGWARW SRFVAGRPVV VAIVSGAIMV AIALPFFSME LGASDQGSDP KSSTTRDGYD
LIASDFGVGY NSTLEAVVSG PGASDQAYLQ RVTKTLSAVP GIDPGSLGTV PLAENVAFVT
FKTTTSPQSE KTYELVRHLR STTLPPLYDG TANHIYTYGD TAINVDFAAV LARKMPLFIA
VVVGLSFVLL LVAFRSLVIP LTAAVMNLLA AGGSFGLVVA IFQYGWLSDS MGAGPGGPID
AWIPVMLFAI LFGLSMDYQV FLVSRMHEEW VHTRDNKRSV TIGQGETGGI ITAAAIIMIA
VFLGFVVSPG RPIKIFGTGL AAAVFIDAFV LRTMLVPSLM HIVGKANWYL PKWLDRITPR
VSVEPADEAV PHSVGTGSFD TDRPEGDDDR PEGDTDRPED EVDRPEDDDD RPEDERELAR
S