Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3152 |
Symbol | |
ID | 5671529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3707966 |
End bp | 3710797 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242047 |
Product | ABC transporter related |
Protein accession | YP_001507467 |
Protein GI | 158314959 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTGC CTCCTACGCA GCTGCTGTTC GACGGTGCGG TGAGCGGTCT GGTGATAGGA CTGCTGGCGG CCGGGATCGT GCTGGTGCAC CGGGCGACAC GTGTCATCAA CTTCGCTGTC GCGAACATGG GCCTGGTCGG CTCCGCGCTA CTCGCGCTGC TGGTGGTCCG CTACAACGTC CCCTACTGGA TCGCGCTCGT CTTGGTCCTC GCTGTGGGCG CGCTGTTCGG AGCGATCATC GACCTGGCGG TGGTCCGCCG GCTGTTCACC GCACCACGGG TGATCGTGCT GGTCGCCACG ACCGGCGTCG CGCAGCTCGC CCTGACGATC GTGACGGCTT ACCCGAAACT CGACGACTAC CCGACCAGCT CCTACCCGCT GCCCTGGACG GGCACCTGGA CGCCGTTCGA CGACCTGCAG ATCACCGCCG CGCAGCTCAG CATCCTGGTC ACGGTGCCCG TCGTCACACT CCTGCTCGGA CTGTTCCTCA GCCGCACCGT GCTGGGCCGG ACGGTACAGG CCTGCGCGGA CAACCCCGAA CTCGCCCGGC TCCAGGGCAT CAGCCCGAAA ACCATGTCCA CCGTGGTGTG GGCAGTCGCC GGCCTGCTCG CCACACTCTG TCTCATCCTC ATCTCCGCGC AGAACCGCTC GCTGACGCAG ATCACGACGC TCGGCCCGAC CACGCTGCTG CGGGCGCTGG CCGCGGCGGC CATCGCCCGG ATGATCTCGT TCCGGATCGC CCTGCTCGCT GGTGTCGTCC TCGGGCTGCT GCAGTCCTTC GTCCAGTTCA ACTGGCTCGA CCAGGCCGGC CTGACCGACA TGGTCATCCT GATCGTGGTT CTTGCGGCAG TTTTCCTCGC CAGCAGAAGC CACGACACCG AAACCTCGTC GTTCTCCTTC GCCCCCAAGG TCCGCCCGAT CCCCGAGCGG CTGCGCAGGC TGTGGTGGAT TCGCAACCTC GAGCGGGCAC CCCTGCTCCT CCTCGGCCTG ATCGCCGTCA TCCTGCCGCT GGTGATAGGT CAGCCGTCAC GCCAACTGCT CTACACCGTC ATCCTCGGCT ATGCGATCTG CGGCGCCTCC GTCACGATCC TCACCGGCTG GGCTGGGCAG CTCTCCCTCG GTCAGATGGC CTTCGCCGGC CTCGGAGCCC TTATCGCCGC GACCCTCAAC CGCGGACTGC ACCTGACGGT CGGCGGTACC ACGATCAGTG CGAGTCCCCA ACCCTTCGCC GTCGCCGTCG CACTGGCCAT GCTGGTCACC GCCGTTCTGG CGACCCTGGT CGGTATCGGC GCGCTACGGG TACGCGGCCT CCTGCTGGCG GTCAGCACCT TCGCGTTCGG CATCGCGGCC GAGCAGTACC TCTACCGCCG CGAATTCCTG CACGATGAGG GCACCACTAC CGCGTCGTTT CCCCGCGCCA GCGTATTTGG AATTGACATC GGCACTCAGC GCGCCTATTA CTATCTGGTC CTGGTGGTAC TGGTCATCGT CATGGTCGTC ATCGCGCGTC TACGCCGCTC GGGCGTGGGA CGGACCACGA TCGGGGTGCG AGACAATTCC GTCGCCGCCG CGGCCTACAC GGTCGGTCCC GTCAGGGTGA AGCTCCGCTC GTTCGCGCTC GCGGGCGGAC TGGCGGGACT TGGCGGAGCG TTGCTGGCAG GTGCAGTGCA GGAGGTGCCC TACACCGACC GGTACTTCCT CAGCACGGAC TCACTGGTGC TGGTGTCCGT CGTCATCATC GGCGGCGTGG GATCAGTGAT TGGCCCTGTG CTCGGCTCAC TGTGGGTGGT CGGCTTACCA GCCATCTTTC CCGACAACAC CATCGTACCG TTGCTAACCT CCAGCCTGGG CCTGCTCCTC CTGCTGCTGT ACTTTCCCGG CGGACTGGTC CAGATCGGCT ACAGCGCCCG CGATGCCCTC CTCGCCTGGG CCGACCGGCG CCTCGGCCCC GTCACAGCAC CGGTGAAAGT AGCCACGGCC AGGCCGGTCG CGCTCACACG TACTCCCAGC ACGCCGCTTC CGGCAGGCAA GCCAGTGCTC GCCACCACCG ACGCGCGGGT ACGTTTCGGT GGACGGGTCG CGGTCGACGG CGTCTCCGTC CAGATCATGC CCGGCGAGAT CGTCGGGCTG ATCGGTACCA ACGGCGCCGG CAAATCCACC CTGATGAACG TCGTCGGCGG GTTCGTGCCC GCCACCGGGA CCGTGACGTT GCTGGGCCAA GACGTCTCCC GTACCAGCCC CGAGAAGCGT GCGAGACTCG GGCTCGGCCG CACATTCCAG GCAGCATCCC TGTTCCCGGA GCTCACCGTT CGGGAGACCG TACAGATCGC GCTGGAGGCC CGCGGTCGCA CCACGTTGCT GTCCACCGCC CTGCACCTAC CGCACACCTT CGCCCGCGAA CGCGCCAAGC GTTCCGAGGC AGAGGACCTG ATCGACTTCC TCGGCCTCGG CCTCTACGCC GACGCCTTCA TCGCCGACCT GTCCACCGGC ACCCGTCGCA TCGTCGAACT CGCGGGCCTC CTCGCCCTCG ACGCCCGCGT GCTCTGCCTC GACGAACCCA CCGCCGGCGT CGCCCAGCGC GAGACCGAGG CGTTCGCTCC GCTCATCCAG GAGATCCGCC GCGAACTCGG CGCGGCCATG CTCGTCATCG AACACGACAT GCCACTGATC ATGAGCATCA GCGACCGCGT CTACTGCCTC GAGACAGGCA GGGTCATCGC CGCCGGTCCT CCTGGCACCG TCCGCAACGA CCCCAAAGTC ATCGCCAGCT ACCTCGGCAC CGACGAACGC GCCATCGAAC GCAGCGGAGT CTCCACCACC GCGACGGCGT AG
|
Protein sequence | MELPPTQLLF DGAVSGLVIG LLAAGIVLVH RATRVINFAV ANMGLVGSAL LALLVVRYNV PYWIALVLVL AVGALFGAII DLAVVRRLFT APRVIVLVAT TGVAQLALTI VTAYPKLDDY PTSSYPLPWT GTWTPFDDLQ ITAAQLSILV TVPVVTLLLG LFLSRTVLGR TVQACADNPE LARLQGISPK TMSTVVWAVA GLLATLCLIL ISAQNRSLTQ ITTLGPTTLL RALAAAAIAR MISFRIALLA GVVLGLLQSF VQFNWLDQAG LTDMVILIVV LAAVFLASRS HDTETSSFSF APKVRPIPER LRRLWWIRNL ERAPLLLLGL IAVILPLVIG QPSRQLLYTV ILGYAICGAS VTILTGWAGQ LSLGQMAFAG LGALIAATLN RGLHLTVGGT TISASPQPFA VAVALAMLVT AVLATLVGIG ALRVRGLLLA VSTFAFGIAA EQYLYRREFL HDEGTTTASF PRASVFGIDI GTQRAYYYLV LVVLVIVMVV IARLRRSGVG RTTIGVRDNS VAAAAYTVGP VRVKLRSFAL AGGLAGLGGA LLAGAVQEVP YTDRYFLSTD SLVLVSVVII GGVGSVIGPV LGSLWVVGLP AIFPDNTIVP LLTSSLGLLL LLLYFPGGLV QIGYSARDAL LAWADRRLGP VTAPVKVATA RPVALTRTPS TPLPAGKPVL ATTDARVRFG GRVAVDGVSV QIMPGEIVGL IGTNGAGKST LMNVVGGFVP ATGTVTLLGQ DVSRTSPEKR ARLGLGRTFQ AASLFPELTV RETVQIALEA RGRTTLLSTA LHLPHTFARE RAKRSEAEDL IDFLGLGLYA DAFIADLSTG TRRIVELAGL LALDARVLCL DEPTAGVAQR ETEAFAPLIQ EIRRELGAAM LVIEHDMPLI MSISDRVYCL ETGRVIAAGP PGTVRNDPKV IASYLGTDER AIERSGVSTT ATA
|
| |