Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3248 |
Symbol | |
ID | 5671622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3838494 |
End bp | 3841373 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242140 |
Product | ABC transporter related |
Protein accession | YP_001507560 |
Protein GI | 158315052 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTCC CCACCGCGCA ACTCCTGTTC GACGGGGCGA CCACCGGACT GGTCATCGGC CTGCTCGCCG TCGGCATCGT CCTGGTCAAC CGGGCGACCC GGATCATCAA CTTCGCTGTC GCGAACATGG GCCTCGTCGG CTCCGCGCTC TGCGCGCTGC TGGTCGTGCG CTACAACGTG CCCTACTGGA TCGGGCTGGC CGCCGCGCTC GCCGCCGGGG CGCTGTTCGG CGCCATCATC CATGTGGGCG TCATCCGCCG GCTGTTCACC GCCCCCCGCG TCATCGTCCT GGTCGCCACG ATCGGAGTCT CCCAGCTCGC GCTGACCATC GTGAACGCCT ATCCGGACCT GAAGGACCAC GCCGACCAGT CCTACCCGGT GCCCTGGTCC GGGACCTGGT CACCGGTGGA CGGCGTGCAG GTCACCGGCG CGCAACTCAG CATTCTCGTG GTCGTCCCCG TCGTGACCGC CGGGCTCTCC CTGTTCCTGA ACCGCACCGT CCTCGGCAAG ACGGTGAAGG CCTCCGCGGA CAACCCCGAG CTCGCCCGGC TGCAGGGCAT CAACCCGAAG ACCATTTCGC TGGCCGTCTG GACGGCCGCC GGCTTCCTCG GCACGCTGTC CATGATTCTG GTCGCCGCGC AGGAGAAGTC CCTGGCGCAG GTCACCACCC TCGGGCCGAC GACGCTGCTG CGGGCGTTGG CCGCCGCCGT GATCGCCCGG ATGGTCTCGG TCCGCATCGC GCTGGTCGCC GGCATCGGCC TCGGGCTGTT GCAGTCGTTC GTGCAGTTCA ACTGGCTCGA CCAGCCGGGC CTCACCGACA CCGTCATCCT CGTGATCGTC CTGGTTGCCG TGTTCTTCAC CAGCCGGGGC AGGAGCACCG AGGTCTCGAC GTTCTCGTTC GCGCCCAGGG CCCGCCCGGT CCCCGAGCGG CTGCGCGAGC TGTGGTGGGT CCGGCATCTC GAGCAGGCTC CACTGCTGCT TCTCGGGCTG TTCGCGCTGC TGCTGCCGGT CGTGGTCACC CAGCCCTCAC GCCACCTGCT CTACACGATC ATCCTCGGCT ACGCGATCTG CGCGGCTTCG GTCACCGTCC TCACCGGCTG GGCCGGCCAG CTCTCGCTGG GCCAGATGGC CTTCGCGGGC CTCGGCGCGC TCACCGCCGC CGCCCTGTTC CGCGGCCTCC GCCTGGACAT CGGCGACACG TCACTGGCGA TCAACGCCCT GCCGTTCCCG GCGGCGGTCC TGATCGCCAC GGTTCTCACC GCGGCCATCG CCGCGGTCAT CGGCCTGGGC GCGCTCCGGG TACACGGACT CCTGCTCGCG GTGAGCACCT TCGCGTTCGC CGTCGCCGCC GAGCAGTTCC TCTACCGGCG GGAGGTCTTC CACGACGAAG GCAGCAGCGC CGCGTCCTTC ACCCGCGGGA CCCTCTTCGG GATCGACATC GCCAGTCAGC GCACCTACTA CTACGTGGTG CTGGTGACCC TGGCCATCGT CATGGCCGTC GTGTCCCGGC TGCGCAAGTC CGGTATCGCC CGCACCACCA TCGGTGTCCG GGACAACCCC ACCACGGCCG CGGCCTACAC CGTCAGCGCC ACCGGCGTGA AACTGCGCGC CTTCGCTCTC GCGGGCGCGC TGGCAGGCCT CGGCGGCGCG CTGCTCGCCG GCGCGCTGCA GACAGTCCCC TACAACGACA TGTACTTCCG CAGCCCGGAC TCCCTCGTCC TCGTCTCCAT CGTGGTCATC GGCGGGCTCG GCTCCGTCTA CGGTCCGCTG CTCGGCTCGC TGTGGGTGAT CGGCCTGCCC TCCTTCATGC CCGACAACGA CATCGTGCCG CTGCTCTGTT CGAGCATGGG CCTCCTCGTC CTGCTGCTCT ACTTTCCGGG CGGCCTGGTC CAGGTGGGCT ACAGCACCCG CGACGCCATC CTGGGCTGGG CCGAGCGGCG CCGCGGGGAC GCGCCCACCA GCAAGACCGC CACCGCCCCG CCCGCCGCGC TGACCCGCAC CGACCGGGAG CCCCTGCCCG CCGGGCGGCC CGCGCTCACG ACGAGCGGAA TCCGGGTGCG TTTCGGCGGG CGGACCGCCG TCGACGGAGT CTCGATCGAG GTGATGCCCG ACGAGATCGT CGGTCTCATC GGGACCAACG GCGCCGGCAA GTCCACCTTC ATGAACGCCG TCGGCGGCTT CGTCCCCGCC GCCGGCGCCG TCACCATCCT GGGCCACGAC GTCTCCACAG CAAGCCCCGC GGCGCGGGCC AAGGTCGGCC TCGGCCGTAC CTTCCAGGCC GCGACGCTGT TCCCCGAGCT CACCGTCCGC CAGACCGTGC AGATCGCGCT GGAGGCTCGG GGTCGCACCG CGTTCCTGTC CACCGCCCTG CACCTGCCGC AGACCTTCGC CCGCGAGCGC GCCAAGCGGT CCGAGGCCGG CGACCTCATC GACTTCCTCG GCCTGGGCCG CTACGCCGAC GCCTTCGTCG CGGAACTCTC GACCGGAACC CGCCGCATCG TCGAACTCGC CTGCCTGCTC GCGCTCGACG CGAAGATGCT CTGCCTCGAC GAGCCCACGG CAGGCGTCGC CCAACGCGAG ACCGAGGCCT TCGGGCCGCT CATCCAGGAG ATCCGCCGCG AACTCGGCGC CGCGATGCTC ATCATCGAGC ACGACATGCC GCTGATCATG GGAATCAGCG ACCGTGTCTA CTGTCTCGAA GCCGGGAAGG TCATCGCTGC CGGGGTACCC GGCGCCGTCC GCAACGACCC CAGGGTCATC GCGAGCTACC TCGGCACCGA CGAACGCGCC ATCCAGCGCA GCGGCGCCAC CGTCGACATG CCCGGTCCGG CGGCCGTCGA CGACCGCGCC CCGACCGGCA ACCCGACAGC CGCGGTGTGA
|
Protein sequence | MQLPTAQLLF DGATTGLVIG LLAVGIVLVN RATRIINFAV ANMGLVGSAL CALLVVRYNV PYWIGLAAAL AAGALFGAII HVGVIRRLFT APRVIVLVAT IGVSQLALTI VNAYPDLKDH ADQSYPVPWS GTWSPVDGVQ VTGAQLSILV VVPVVTAGLS LFLNRTVLGK TVKASADNPE LARLQGINPK TISLAVWTAA GFLGTLSMIL VAAQEKSLAQ VTTLGPTTLL RALAAAVIAR MVSVRIALVA GIGLGLLQSF VQFNWLDQPG LTDTVILVIV LVAVFFTSRG RSTEVSTFSF APRARPVPER LRELWWVRHL EQAPLLLLGL FALLLPVVVT QPSRHLLYTI ILGYAICAAS VTVLTGWAGQ LSLGQMAFAG LGALTAAALF RGLRLDIGDT SLAINALPFP AAVLIATVLT AAIAAVIGLG ALRVHGLLLA VSTFAFAVAA EQFLYRREVF HDEGSSAASF TRGTLFGIDI ASQRTYYYVV LVTLAIVMAV VSRLRKSGIA RTTIGVRDNP TTAAAYTVSA TGVKLRAFAL AGALAGLGGA LLAGALQTVP YNDMYFRSPD SLVLVSIVVI GGLGSVYGPL LGSLWVIGLP SFMPDNDIVP LLCSSMGLLV LLLYFPGGLV QVGYSTRDAI LGWAERRRGD APTSKTATAP PAALTRTDRE PLPAGRPALT TSGIRVRFGG RTAVDGVSIE VMPDEIVGLI GTNGAGKSTF MNAVGGFVPA AGAVTILGHD VSTASPAARA KVGLGRTFQA ATLFPELTVR QTVQIALEAR GRTAFLSTAL HLPQTFARER AKRSEAGDLI DFLGLGRYAD AFVAELSTGT RRIVELACLL ALDAKMLCLD EPTAGVAQRE TEAFGPLIQE IRRELGAAML IIEHDMPLIM GISDRVYCLE AGKVIAAGVP GAVRNDPRVI ASYLGTDERA IQRSGATVDM PGPAAVDDRA PTGNPTAAV
|
| |