Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3676 |
Symbol | |
ID | 5672042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4351562 |
End bp | 4352893 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242559 |
Product | ABC-type branched-chain amino acid transport systems periplasmic component-like protein |
Protein accession | YP_001507979 |
Protein GI | 158315471 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.36982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTC ATGGCCGCCG CCTGCGCTGC CTGGCCATCC TGCTCGCCGT CGGCTGTCTG GTCGCCGCCT GCTCGAACTC CTCCGACTCG AACACCGCCG CCCCCGCCTC GTCCGCGGGT GGCGGCGCGG TACCGGGTGT CACGAACGCC GAGATCCGGT TCTCCGTCCT GGGGACGCGA ACCAACAACC CGCTGGGCAC CTGCCTGCTG GACTGCTTCT CCCAGGGGGT CAAGGCGTAT TTCGACTACC GGAACAGCGA GGGTGGCGTC CACGGCCGCA AGCTCGTCGT CGCCCAGGAG CTCGATGACG CGCTCGGGCA GAACCAGCAG AAGGCGATCG AGATCGTGTC GGCGAAGGAC ACCTTCGCGA CCTTCAGCGC CCCGCAGCTC GCCAGCGGAT GGCAGAACTT CGCGGACGCG GGGATCCCGT TGTATGGATG GGACATCCAC CCGGCGCAGA TGATCGGCCG GAAGAGCATC TTCGGTAACG CGGCCCCGCC CTGCCTGGAC TGCATGGACC GCACTTTCAC CTACGTCGCC GAACTGGCCG GCACCAAGCG GATCGCGGCG CTCGGGTACG GCGTGTCCGA CAACTCGAAG CAGTGCGTCA CCCAGATCAC CGACACGATC GAGCGGTACG GTGCGGCCAC CGACCAGGAG ATCGTCTACA AGAATGACAC CCTGGCGTTC GGCCTGAGTA ACGGGGTCGG TCCGGAGGTG GCCGCGATGA AGCGCGCGGA CGCGCAGCTG GTCATCACCT GTCTCGATCT GAACGGCATG AAGACCCTGG CCCAGGAACT CGAGCGGCAG GGCATGGGCG ACGTCCCGAT GTACCACCTG AACACCTACA ACCAGGAGTT CGTCGAGGAG GCCGGCGCGC TTTTCGAGGG TGACTACGTC AGTGTCGGGT TCCGGCCGTT CGAGGCCGAT CCCGCGGACA CCGCCATGGC CACGTTCAAG AAGTGGATCG GCAAGGTGGG CGGCCGGCCC GACGAGATGG CGATGTACGG ATGGATCAAC GCGGACCTCG CCTACCAGGG CATCCTCAAG GCGGGGCCGG CGTTCACCCA GGCGTCCGTG ATCGACGCCA CCAACCACCT GACCGACTAC ACCGCGGATG GCCTGACCGT TCCGGTGGAC TGGTCGCGTC AGCACAACCA GCGTTCCAAG GCCGACCCGG TCACCAACGG CTACAAGCTC GACTGCCGGG CGCTCGTCCG CGTGCGGGGC GGGAAGTTCG AGATCGTCGG TGGAACGAAG GACAAGCCGT TCGTCTGCTG GCCGCCGAAG GACACCACCT GGTCCGAGCC GAAGCCGACG AGCTTCGGCT GA
|
Protein sequence | MKRHGRRLRC LAILLAVGCL VAACSNSSDS NTAAPASSAG GGAVPGVTNA EIRFSVLGTR TNNPLGTCLL DCFSQGVKAY FDYRNSEGGV HGRKLVVAQE LDDALGQNQQ KAIEIVSAKD TFATFSAPQL ASGWQNFADA GIPLYGWDIH PAQMIGRKSI FGNAAPPCLD CMDRTFTYVA ELAGTKRIAA LGYGVSDNSK QCVTQITDTI ERYGAATDQE IVYKNDTLAF GLSNGVGPEV AAMKRADAQL VITCLDLNGM KTLAQELERQ GMGDVPMYHL NTYNQEFVEE AGALFEGDYV SVGFRPFEAD PADTAMATFK KWIGKVGGRP DEMAMYGWIN ADLAYQGILK AGPAFTQASV IDATNHLTDY TADGLTVPVD WSRQHNQRSK ADPVTNGYKL DCRALVRVRG GKFEIVGGTK DKPFVCWPPK DTTWSEPKPT SFG
|
| |