Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3395 |
Symbol | |
ID | 5671766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4022954 |
End bp | 4025824 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242283 |
Product | ABC transporter related |
Protein accession | YP_001507703 |
Protein GI | 158315195 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0411] ABC-type branched-chain amino acid transport systems, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATT TCTTGCCATT CATCGTCATC GGTCTTGCGA CCGGCGCGGT CTACGGCCTC GCTGGAATCG GCCTGGTGCT CACCTATAAA ACGTCGGGTA TATTCAACTT CGGCTACGGT GCGGTAGCCA CGCTCGTCGC GTTCTGTTTC TATTTCCTGA ACGTCGATCA CGGCTGGCCG TGGCCGCTCG CGGCGGCCGT CTCGCTTCTC GTGTTCGCGC CGTTGCTCGG CCTGGTGCTG GAGCTGTTGG CCCGGTCGCT CAACGGAGCC AGCGAAACGA TCAAAGTCGT CGCGACCGTC GGACTGATCC TCGTCGTGTC GAGCATCGGG CTGCTGTGGC ACCCGGTCAA TCCGCCGACC TTCCCCCATT TCCTGCCGCA GGACACGGTG CGGATGGTCG GGGTGAACGT CACCTGGGAG GAGATCATTC TCTTCCTGCT GTCGGCGGCG GCCGCTGCCG GGCTCTACTG GTTCTTCCGG TCCGTCCGGT TCGGCATCGT CATGCGCGGT GTCGTCGACA ACCATGAGCT CATCTCCATG AGCGGCGACG ACCCGGTACT TGTCCGCCGG GCGGCCTGGG TCATCGGCAG CGTCTTCGCC GGCGTGGCCG GTCTGCTGCT CGCACCGTCG CACGACCTCG ACGGCGTCAC CCTGACGACG ATCGTGTTCG CGGCCTTCGG GGCCGCCGCC ATCGGCTACT TCACCAACCT GCCGCTGACG TTCGTCGGCG GCCTGGTGGT GGGTATCGCC AGTTCGCTGG TCGACAAGTA CTCCGCGACC ATCACGTGGA TCGGCGGGCT GCCGCCGGCG CTGCCGTTCG TCATCCTGTT CGTGGCCCTC ATCGTGCTGC CCCGGCGCCT GCTCGCGCAG CGGCGGCTGA CGGCCGTGCT GAGCACCCGG CGCTCCTACC ACGCTCCGGT CCGGATCCGG CTGACGACGT TCGCGATCGC GATCGTCCTG CTCGGCCTGG TACCGACGAT CCAGTCCGGC CACATCGCGG TGTGGTCGTC CGCGCTCATC AACATCATGC TGTTCCTGTC GCTGGGCCTC CTTGTCCGGC GGTCCGGCCA GATCTCGCTG TGCCACCTGG CGTTCGCCGC GGTCGGCGCG GCCGCCTTCG GGCACTTCTC CAACAGCATG CCGTGGCTGC CGGCGCTCAT CCTGGCGACG CTGGTCGCCA TCCCGGTCGG TGCCCTGATC TCGATTCCGG CCGTGCGGGT GTCCGGTGTG TTCCTCGCGC TGGCCACGCT GGGGCTGGGC ATTCTCGCCG AGCAGGTCTT CTACACCCGG AACTTCATGT TCAGCCAGTC CGCGCTGGGC ATCGAGGCAC CGCGGCCGGT CTTCTCCATC GGCAGTCTGG ACCTCTCGAG CGACAAGGGC TTCTACTACC TGCTGCTGGT CATCACCGTG CTCGTCGCCG GCGTCATCAC CGCCATCGGC CACGGCCGGC TGGGCCGGCT GCTCGAGGCG CTGGCGGACT CGCCGCTCGC GCTGGAGACC CACGGGACGA CCTCGAGCGT CCTCAAGGTG ATCGTCTTCT GCGTCACCGC CGCGATCGCG TCCCTGGCCG GGGCGCTCAA CGCGATGCTG TTCCACTTCG GCGTCGGCAC CTACTACCCG TCGTTCAGCT CGCTGACCCT CGTCGCGCTG GTCGTGATCG TCACCATCGG CGATCCGTGG TACGCACTCG TCGCGGCCGT CGGCTACAGC GTGCTCCCGG CCTACATCAC CGGCCAGAAC ACCAGCACCG TCCTCAACCT GCTCTTCGGG CTCGGCGCCG CCACGGCCGC GTGGGGCACC CGGGGCGGCG TCACACCCGC CCGCCTGCAG GCGCTGCTGG ACCGACTGGG CGGCAGGGCG GCCCCTGTCA CCTTGGACGG CCTGGCCGCC GACGGCCTGG CCGCCGACGG CCTGGCCGCC GACGGCCTGG CCGCCGACGG CCTGGCCGCC GACGGCCTGG CCGCCGTCGA CACAGCCGCC GTCGACGCAC CGGCAACGCC CGGCGTGCCG GCCCGCCAGG TGCCCTCGCC CCGCCGCGAG GCTCCCACCG GGGACGGCCT GACAGTCCGC GACCTGTCCG TCCGCTTCGG CGGCGTGCAC GCCGTGAACG GGGTGACCCT CAAGGCCAGG CCCGGGGCCA TCACCGGTCT CATCGGGCCG AACGGCGCCG GCAAGACCAC GACGTTCAAC GCCTGCAGCG GCCTGCTCCG GCCAAGCTCC GGCGAGGTCC TCCTGCACGG CGCCAACGTC ACGGGGGAGG GGCCCGCCAG CCGGGCCCGG CACGGGCTGG GACGGACGTT CCAGCGCACC GAGCTGTTCA ACAGCCTCAC CGTGCGGCAG AACGTCGCCA TGGGCCGCGA GGCGTCCATG GCCGGCGCGA ACCCGCTCAA CCACCTGGTG AGCTCGCGGC ACGCGAACCG TGTGGTCTCC GAGGTCGTCG AGGAGTCGCT CGCGCTCACC GGAACCACCC GGATCGCAGA CCTGCAGGTC GGGCTGCTCC CGATCGGGCA GCGGCGCCTG GTGGAGCTGG CCCGCGCCCT CGCCGGTCCG TTCGACATGC TGCTGCTGGA CGAGCCCTCC TCCGGGCTGG ACGGCCACGA GACCGAGCAG TTCGGCCAGG TTCTCCAGAC CGTGGTTCGC GAGCGCGGCT GCGGTGTCCT GCTCGTCGAG CACGACATGA GCCTGGTCCG GGAGATCTGC GACTACCTGT ACGTGCTCGA CTTCGGGCAA CCGATCTTCG AAGGGACCCC AGACCAGATG GAGAGCTCAG ACCAGGTCCG CAGCGCCTAC CTCGGCAGCG TGGCCGTTGC CGCGGACAGT GCGGACGACC CATCCACCGA CCGGAGCATG CCGCTCCAGC CCCAGGAGTA G
|
Protein sequence | MNDFLPFIVI GLATGAVYGL AGIGLVLTYK TSGIFNFGYG AVATLVAFCF YFLNVDHGWP WPLAAAVSLL VFAPLLGLVL ELLARSLNGA SETIKVVATV GLILVVSSIG LLWHPVNPPT FPHFLPQDTV RMVGVNVTWE EIILFLLSAA AAAGLYWFFR SVRFGIVMRG VVDNHELISM SGDDPVLVRR AAWVIGSVFA GVAGLLLAPS HDLDGVTLTT IVFAAFGAAA IGYFTNLPLT FVGGLVVGIA SSLVDKYSAT ITWIGGLPPA LPFVILFVAL IVLPRRLLAQ RRLTAVLSTR RSYHAPVRIR LTTFAIAIVL LGLVPTIQSG HIAVWSSALI NIMLFLSLGL LVRRSGQISL CHLAFAAVGA AAFGHFSNSM PWLPALILAT LVAIPVGALI SIPAVRVSGV FLALATLGLG ILAEQVFYTR NFMFSQSALG IEAPRPVFSI GSLDLSSDKG FYYLLLVITV LVAGVITAIG HGRLGRLLEA LADSPLALET HGTTSSVLKV IVFCVTAAIA SLAGALNAML FHFGVGTYYP SFSSLTLVAL VVIVTIGDPW YALVAAVGYS VLPAYITGQN TSTVLNLLFG LGAATAAWGT RGGVTPARLQ ALLDRLGGRA APVTLDGLAA DGLAADGLAA DGLAADGLAA DGLAAVDTAA VDAPATPGVP ARQVPSPRRE APTGDGLTVR DLSVRFGGVH AVNGVTLKAR PGAITGLIGP NGAGKTTTFN ACSGLLRPSS GEVLLHGANV TGEGPASRAR HGLGRTFQRT ELFNSLTVRQ NVAMGREASM AGANPLNHLV SSRHANRVVS EVVEESLALT GTTRIADLQV GLLPIGQRRL VELARALAGP FDMLLLDEPS SGLDGHETEQ FGQVLQTVVR ERGCGVLLVE HDMSLVREIC DYLYVLDFGQ PIFEGTPDQM ESSDQVRSAY LGSVAVAADS ADDPSTDRSM PLQPQE
|
| |