Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4737 |
Symbol | |
ID | 5673079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5656225 |
End bp | 5659047 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243594 |
Product | ABC transporter related |
Protein accession | YP_001509010 |
Protein GI | 158316502 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0410] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGAGT TCCTCAACCT CATCGTCGGC GGCGCCGTCG CCGGCGGTCT CTACGCCATC CTGGCGGCCG GCGTGGTCTT GACCTATCAG ACGTCGGGCA TCTTCAACTT CGCGCACGGT GCGGTCGCCT TCGCCACCGC CTATCTGTTC GTGCAACTCA ATGTCACCGC CGGCGTGCCG GTCGTCCCCG CGGCGATCAT TTCCATTCTT GTCTTCGCGC CGCTGCTGGG CCTGTTACTG GACAGGCTGG TCTACCGGCG GCTCGCACAC GCCCCGTTGG CCGTGAAGAT CGTGGTACCG ATCGGGCTGC TGATCGCGGT GCCCGGGCTC TGCCTGTTCA TCTCCAGCCG GCTGAACCTG TGGTTCCACC TGGACATCGC CGGCATCGAG GATCTGTTCC TGATCCCCGG TCTGGGGCCG ACCCCGAAAA AGACGTGGAC GCTCGGCGGG GTGCTGCTGG ACAGCAACCA GATCGCCGTC TTCGGCGCCG CTGTGCTCAC GGCCGTCGGT CTGTGGGTCC TGCTGCAGCG CACACGGCTC GGGCTCCAGA TGCGGGCCGT GGTGGACCGG CGGAGCCTGG CCGCCCTGCG CGGCGTCGAT CCCGACCGGA CCTCCGCCGT GTCGTGGATG CTCGGCAGCT TCCTCGCCGG GCTCGCCGGC GTGCTGGTCG CCCCGATCTT CACGCTCAAC ACCCCGGTGT TCACCACGGT CGTGCTGATC TCGACGCCGG CCGTCGTGTT CGCCCGGTTC CGCTCGATGC CCCTCGCGCT CGCCGGCGGC CTGCTGATCG GCGTCCTGCA GAACCTCATC GTCGGCTACG CGGACTTCGC CCAGAACATC TCCGGGTTCA GCACGTCGGT GCCGTTCATC CTGCTACTCG GGCTGCTGTT CGTGTTCGCC GTGGACCGCA GCCGGCGGGC CGGCGCGCAC GCCGACGACG ACCCGCCCCC GCCGTCCTAC GGGACGGAGT CGAAACGCCG CAAGGTCGTC ACGTGGACGG TGTGGAGCGT CGTCGTCCTC GTCTTCGCGC TGTTCGTGGC GGACGAGTAC TGGCAGTCAC TCATCATCCG CGGGCTGGCG CTGTCGCTGG TGCTGCTCTC GTTCACGGTG GTGACCGGAG TCGGCGGGAT GGTCAGCCTC GCGCAGGCTG CGTTCGTGAC CGCCGCCGGC ATCACGGCCG GCTGGGCGGT CAGCCACCAC TGGCCGTTCG CGGTCGCGCT GCTCCTCGGC ACCGCGGTCG CCACCGCGAT GGGCGTGCTG GTGTCGCTGC CCGCCCGGCG GCTCGGCGGA CTGCCCCTGG CGCTGGCGAC CCTGGCCCTG GCGTACCTCT GCCAGAACCT GTTCTTCCAG CTCAGCGGCG TGAGCCGGAA CGACTTCGGT GGGTGGGTGC TCAGCCCGCC CGCCCTCGGG CCGGTGGACC TCGCCGACCC CCGCTCAATG ATCATTTTCC TGTCGGTCGT GCTGGGTCTC GCCCTTCTCC TGGTCAGCAA CCTGATCCGG TCGAGCTCCG GCCGGGCGAT GGCCGCGCTG CGCTCCACCG AGCCGGGCGC GGTGACCATC GGCATCTCGG CGGGCCGCAC GAAGACCGCG GTGTTCGCCC TGTCCGCGGC CATCGCCGGG TTCGGCGGTG TGCTGCTCGC GAGTTCGTCC GGGCGCATCA CCCACCTCGA CTATCCGGTC GAGACGGGCC TGTTCTGGCT CGCCACGGTC GTGCTGTTCG GGGTGCGCCG GCCGGCCGCG GCCGTGATCG CCGGGCTGAG CGCCGCCGTC AGCCCCGAGA TCCTCAGCCA CGTCGCGGAG ACGTCCTACC TGCCGCAGGT GCTCTCCGGC CTGGCCGCGA TCAACCTGGC GCAGAACTCC GACGGCATCC TGGCGCTCAC CGCGCAGCAG CGGTTCGAGC GTCGCCGCAG GCGGCAGCAG CGGGCGCTGC GCCGGGAGGC CGCCGCCACG GCGGCGACCC CGGTCAGCGC GACCCCGAAC AGCGCGGCTG CGGTCACCGC GCCGGGCACC GAGGCCTCGG AGGGTGCCTC GACCGTGCTC GAGCTGCGTG ACGTGCACGC CGCCTACGGC GCCGTGGAGG TGCTGCACGG GGTCAGCCTG ACGCTGCGGG CGGGCGAGGT GCTCGCCCTC ATCGGGGCCA ACGGGGCCGG CAAGTCGACG ATCTGCGGGG TGGTCACCGG CGGCGTCCCG GTGACGCACG GCACCGTGCG GCTCGACGGC CAGGACGTCA CGGGGCTGCC ACCGCACCGC AGAGTCCGGC ACGGCGCCTT CCTCATCCCC GAGGGCCGCG GCATCTTCCC CGCTCTGACC GTGGACGAGA ACCTCTCCCT GTGGCTCCCG TCCGCCGATG ACCGGGACTC CGCCTACCGC CGCTTCGCGG TGCTCGGCCG GCGCCGCGGC CAGCTCGCCG GATCCCTGTC CGGCGGCGAG CAGCAGATGC TGGCCCTGGC GCCGGCGCTG GTGAGGCCCC CCGCCGTCCT GATCGTGGAC GAGCCCTCGC TCGGCCTGGC CCCCCTCGTC GTCGCCGAGG TCTACGCCGC CCTGGAGGAA CTGCGGGCCA CCGGGACCGC GATCCTGCTG GTCGAGGAGA AGGCCCACGA CGTGGTCGCG CTCGCCGACA CCATCGCCTT CATGGCCGTC GGCCGGGTGG CCTGGGCGCA GCGGACCGCG GACGTCGACG CGGACCTGCT GGTGCAGTCC TACCTCGGGA TCAGCGACAC GCCGGCCGGG CCCTCCGGGT CACCGGCAGC CCTGGCCGGC AGCACGACCG GCCCCAGCAC GACCGGCCCC GCCGTCCCGC AACCCGCGAA GGAGCGTCCA TGA
|
Protein sequence | MEEFLNLIVG GAVAGGLYAI LAAGVVLTYQ TSGIFNFAHG AVAFATAYLF VQLNVTAGVP VVPAAIISIL VFAPLLGLLL DRLVYRRLAH APLAVKIVVP IGLLIAVPGL CLFISSRLNL WFHLDIAGIE DLFLIPGLGP TPKKTWTLGG VLLDSNQIAV FGAAVLTAVG LWVLLQRTRL GLQMRAVVDR RSLAALRGVD PDRTSAVSWM LGSFLAGLAG VLVAPIFTLN TPVFTTVVLI STPAVVFARF RSMPLALAGG LLIGVLQNLI VGYADFAQNI SGFSTSVPFI LLLGLLFVFA VDRSRRAGAH ADDDPPPPSY GTESKRRKVV TWTVWSVVVL VFALFVADEY WQSLIIRGLA LSLVLLSFTV VTGVGGMVSL AQAAFVTAAG ITAGWAVSHH WPFAVALLLG TAVATAMGVL VSLPARRLGG LPLALATLAL AYLCQNLFFQ LSGVSRNDFG GWVLSPPALG PVDLADPRSM IIFLSVVLGL ALLLVSNLIR SSSGRAMAAL RSTEPGAVTI GISAGRTKTA VFALSAAIAG FGGVLLASSS GRITHLDYPV ETGLFWLATV VLFGVRRPAA AVIAGLSAAV SPEILSHVAE TSYLPQVLSG LAAINLAQNS DGILALTAQQ RFERRRRRQQ RALRREAAAT AATPVSATPN SAAAVTAPGT EASEGASTVL ELRDVHAAYG AVEVLHGVSL TLRAGEVLAL IGANGAGKST ICGVVTGGVP VTHGTVRLDG QDVTGLPPHR RVRHGAFLIP EGRGIFPALT VDENLSLWLP SADDRDSAYR RFAVLGRRRG QLAGSLSGGE QQMLALAPAL VRPPAVLIVD EPSLGLAPLV VAEVYAALEE LRATGTAILL VEEKAHDVVA LADTIAFMAV GRVAWAQRTA DVDADLLVQS YLGISDTPAG PSGSPAALAG STTGPSTTGP AVPQPAKERP
|
| |