Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3190 |
Symbol | |
ID | 5671566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3760982 |
End bp | 3763852 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242084 |
Product | ABC transporter related |
Protein accession | YP_001507504 |
Protein GI | 158314996 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATCC CCACCACCCA GCTGCTCTTC GACGGCGCGG TCACCGGCCT GGTGATCGGC CTGCTCGCGG TCGGCATCGT CCTCGTCCAC CGCTCGACCC GCGTCATCAA CTTCGCGGTG GCGAACATGG GCCTGGTCGG CTCGGCCCTC TTCGCACTGC TCACGGTGCG TTACAACGTT CCGTACTGGA TCTCGCTGGC CATCGCGCTG CTCGTAGGTG TGCTGTTCGG CGCCCTGGTC GACCTCACGG TGATCCGCCG GCTGTTCGCC GCGCCACGGG TGATCCTGCT GGTCGCGACC ATCGGCGTCG CCCAGCTCGC GCTGACCGTC GTCACCTCCC TGCCCGACCT CGACGACTAC CCGAGCGAGT CCTACCCGGT GCCCTGGTCG GGGAGCTGGT CACCGGCCAG CGGCCTCACG ATCACCGGGG CGCAGCTGAG CGTCCTGGTC ACGGTGCCTC TCGTCGCGCT CGGGCTCAGC CTGTTCCTGG GCCGCACCGT GCTCGGCAGG ACGGTCAAGG CCGCCGCGGA CAACCCCGAG CTCGCCCGCC TTCAGGGCAT CAGCCCCAAG ACCGTGTCGA CCGCGGTCTG GGCGGTGGCG GGGCTGATCG GAACTCTGTG CCTGATCCTG GTCTCCGCGC AGAACCAGTC CCTGACCCAG ATCACCACCC TCGGCCCCAC CACCCTGCTG CGAGCCCTGG CCGCCGCGGT GATCGCCCGG ATGACGTCGT TCCGCGTCGC GCTGCTCGCC GGGATCGCCC TGGGCCTGGC CCAGTCGTTC ATCCAGTTCA ACTGGCTCGA CCAGCCCGGC CTCACCGACC TGACGATCCT GGTCGTGGTG CTGGTCGCGG TGTTCTTCGT CAGCCGCGGC CGGGACACCG AAACCTCGTC GTTCTCCTTC GCCCCGAAGA TCAAGCCCGT GCCCGACCGG CTGCGCGGGC TCTGGTGGGT CCGCCACCTC GAACGCGCAC CCCTGGTCCT GCTCGGTCTG GTCGCCGTCG TCCTCCCCCT GCTCGTCGCC CAGCCGTCAC GGCACCTGCT CTACACGGTG ATCCTCGGCT ACGCGATCTG CGCCGCCTCG ATCACGATCC TGACCGGCTG GGCCGGGCAG CTCTCCCTCG GCCAGATGGC CTTCGCGGGA TTGGGCGCGC TGCTCGCCGC CGCACTCAAC CGTGGCCTGC GCCTGCAGGT CGGCGGCTCG GTCATGGCGC TGAACGCCCT GCCGTTCCCA GCGGCGGTCG CGATCGCGAC GGCGTTCACC GCGGCCCTGG CGGCCGTGGT CGGAGCGGGC GCGCTGCGGG TGCGCGGGCT GCTCCTGGCT GTCAGCACCT TCGCGTTCGG GATCGCGGCC GAGCAGTACC TCTACCGCCG CGACGTGTTC CACGACGAGG GCGGCAACAC CGCCTCGTTC CCGCGCAGCA CCGTCTTCGG GATCGACGTC ACCACCCAAC GGGCCTACTA CTACCTGGTC CTGATCGTCC TGGTTCTGGT CCTGCTCGTG GTGGCCCGGC TGCGCAGGTC CGGGGTCGGG CGGACCACGA TCGGCGTCCG CGACAATCCC GCCACCGCGG CCGCGTACAC CGTCGCCCCC ACCCGGGTGA AACTGCGCGC GTTCGCTCTC GCCGGCGGGA TCGCCGGCCT CGGCGGCAGC CTGCTCGCCG GCGCCGTCCA GAACGTCCCC TACGCCGACA GGTACTTCCT CAGCCCCGAC TCGCTGATCC TGGTCTCCAT CGTGGTGATC GGCGGCCTCG GCTCGGTCAC CGGCCCCCTG CTCGGCTCCC TGTGGGTCAT CGGCCTGCCG TCCTTCTTCC CCGACAACGA CATCGTCCCG CTCCTGACCT CCAGCCTGGG CCTGCTGATC CTATTGCTCT ACTTCCCCGG CGGCCTGGTC CAGATCGGGT ACAGCGCCCG CGACGCCCTC CTCGCCTGGG CGGACAGAAG GCTCGGCACC ACTCCAGCGG TGAAGAGCGT CTCGACCGTG CCCCCGGCTC TCACCCGCAC CACCCGCCCG CCGCTCCCGG AGAACACCCC CGTCCTCGAG ACCAGCGACA TCCGGGTGCG ATTCGGTGGC CGGGCCGCCG TCGACGGCGT CTCCCTCACC GTCATGCCCG GCGAGATCGT CGGCCTCATC GGCACCAACG GCGCCGGCAA GTCAACCCTC ATGAACGCCA TCGGCGGCTT CGTCCCCGCC ACCGGCACCG TCACCCTGCT CGGCCGGGAC GTCTCCAACG CCAGCCCGGC GGCCCGGGCC CGCGGCGGTG TCGGCCGAAC CTTTCAGGCC GCCGCGCTGT TCCCCGAGCT CACCGTCCGC GAAACCGTCC AGATAGCGCT GGAGGCCCGC GGCCGCACCG GGCTTCCCTC CACCGCCCTG CACCTGCCGC ACACCTTCCG CGCCGAACGC GCGAAACGCT CTGCCGCCGA CGACCTCATC GACTTCCTCG GCCTCGGCCG CTACGCCGAC GCCTTCATCG CCGACCTGTC CACCGGCACC CGCCGCATCG TCGAGCTCAC CGGCCTGCTG GCCCTCGACG CCCAGGTCCT CTGCCTCGAC GAACCCACCG CCGGCGTCGC CCAACGCGAA ACCGAGGCGT TCGGCCCGCT CATCCAGGAA ATCCGCCGCG AGCTCGGCGC CGCCATGCTC GTCATCGAAC ACGACATGCC ACTGATCATG AGCATCAGCG ACCGCGTCTA CTGTCTCGAA ACCGGCCAGA TCATCGCCAG CGGAACGCCG GACGCCGTCC GCAACGACCC CAGGGTCATC GCCAGCTATC TCGGCACCGA CGAGCGCGCC ATCGAACGCA GTGGAAAAGC CCCGGCGGCC CCGGACCCCG AGGACCACCA CACGCCGACC GACGCGGTCG TGCCCTCGTA A
|
Protein sequence | MEIPTTQLLF DGAVTGLVIG LLAVGIVLVH RSTRVINFAV ANMGLVGSAL FALLTVRYNV PYWISLAIAL LVGVLFGALV DLTVIRRLFA APRVILLVAT IGVAQLALTV VTSLPDLDDY PSESYPVPWS GSWSPASGLT ITGAQLSVLV TVPLVALGLS LFLGRTVLGR TVKAAADNPE LARLQGISPK TVSTAVWAVA GLIGTLCLIL VSAQNQSLTQ ITTLGPTTLL RALAAAVIAR MTSFRVALLA GIALGLAQSF IQFNWLDQPG LTDLTILVVV LVAVFFVSRG RDTETSSFSF APKIKPVPDR LRGLWWVRHL ERAPLVLLGL VAVVLPLLVA QPSRHLLYTV ILGYAICAAS ITILTGWAGQ LSLGQMAFAG LGALLAAALN RGLRLQVGGS VMALNALPFP AAVAIATAFT AALAAVVGAG ALRVRGLLLA VSTFAFGIAA EQYLYRRDVF HDEGGNTASF PRSTVFGIDV TTQRAYYYLV LIVLVLVLLV VARLRRSGVG RTTIGVRDNP ATAAAYTVAP TRVKLRAFAL AGGIAGLGGS LLAGAVQNVP YADRYFLSPD SLILVSIVVI GGLGSVTGPL LGSLWVIGLP SFFPDNDIVP LLTSSLGLLI LLLYFPGGLV QIGYSARDAL LAWADRRLGT TPAVKSVSTV PPALTRTTRP PLPENTPVLE TSDIRVRFGG RAAVDGVSLT VMPGEIVGLI GTNGAGKSTL MNAIGGFVPA TGTVTLLGRD VSNASPAARA RGGVGRTFQA AALFPELTVR ETVQIALEAR GRTGLPSTAL HLPHTFRAER AKRSAADDLI DFLGLGRYAD AFIADLSTGT RRIVELTGLL ALDAQVLCLD EPTAGVAQRE TEAFGPLIQE IRRELGAAML VIEHDMPLIM SISDRVYCLE TGQIIASGTP DAVRNDPRVI ASYLGTDERA IERSGKAPAA PDPEDHHTPT DAVVPS
|
| |