Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2025 |
Symbol | |
ID | 5670426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2433825 |
End bp | 2435093 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240946 |
Product | hypothetical protein |
Protein accession | YP_001506368 |
Protein GI | 158313860 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0203629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.690247 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGTAT TTCGTACGAG GTTGCTCGCC GCGGCGCTCG TGACGGCCGC CCTGGCGGCG GCCTGCGGCT CCAGCGAGTC CTCCGGCGTG GACGCGGCGG CGTCGCCCTG TGCTCCCGGA GTCACGGACG ACGAGGTCAA CGTCGGTATT GTCTACCCGG ACACCGGTGT GCTCTCGGCA CAGTTCACCG GCTACCGGTT CGGCGTCGAG GCCCGGTTCG CCGAGGCGAA CGCCGCCGGT GGCGTCGACG GCCGGCAGAT CTCGACGATC TGGCGGGACG ACGAGTTCGA CTCCGCCGGC AACCTGCAGG CGGCCCGCGA GCTTCTCCGC GAGAACGTGT TCGCGGTTCT CGAGTACACC GCGCATTCCG AACAGTCCAC GCCGCTGTTG CACGACAAGG GAATTCCGGT GGTGGGCGTC GCGGACCAGG CCGGGTGGGC CGACAACGAC AACATGTTCC CGCTCACGTA TCAGGTCGAC GACACCGACG CGACCAGCAC TCTCGGAGAT TTCGTCCGGG CCCAGGGCGG CACCCGCGCG GCCCTCATCA CGACGACGAT CACCGAGTCG TCGGTCATGT ACGCCCAGAA CGCCCGCCGG AGCCTGGAGG CTGCCGGCAT CCCGGTCGTC TTCGCGGACA CGAACGCCAC GGTGAGCTCA CCGGAGGCCG TGCAGCGGAT CGTCGGCAGC GGCGCCGACA CGCTGATCTC GCTCGCCTCG CTCGACCTCT ACTCCGGCGC GGTGACGGCG GCGGCCGCGG CGAACAGGCC GTTCAAGGTG GCGGTCTCGC CCATCACCTA CGACGCGCAC CTGCTCAACA CGCCGATCGC GCGCGCTCTG GCCGGCACCT ACTCGACCGT GGGCTTCAGC GCGATCGAGC GGAACCTGCC CGCGCACCGC GCCTACCTGG CCGCGATGAC CACCTACGCG CCGGAGATCC AGCCGCCGAC GCAGCTGAGC TCCATCTACG GCTACATCAC CGCGGACCTG TTCCTGCGGG GCCTGCAGGG GCAGCAGGGC TGCCCCACCC GCGAGTCGTA CATCCGCGGG CTGCGCGGTG TGACCGACTA CGACGGGGGC GGACTGCTCA ACCAGCCGGT TGACCTGTCC GCCGGCAAGC GCACGGTCGA CCTGTGCACG GACTTCGTCC GGGTCTCCGC CGCCGGGAAC GCCTTCGAGC CCGTCGGCGC GAAGCCGCTG TGCGGCCAGG TGATCAGCGG CGCGGCCCAG GGAGGCGCCG GGGCGTCCGT GCCAGCCCGG GGGCGGTGA
|
Protein sequence | MLVFRTRLLA AALVTAALAA ACGSSESSGV DAAASPCAPG VTDDEVNVGI VYPDTGVLSA QFTGYRFGVE ARFAEANAAG GVDGRQISTI WRDDEFDSAG NLQAARELLR ENVFAVLEYT AHSEQSTPLL HDKGIPVVGV ADQAGWADND NMFPLTYQVD DTDATSTLGD FVRAQGGTRA ALITTTITES SVMYAQNARR SLEAAGIPVV FADTNATVSS PEAVQRIVGS GADTLISLAS LDLYSGAVTA AAAANRPFKV AVSPITYDAH LLNTPIARAL AGTYSTVGFS AIERNLPAHR AYLAAMTTYA PEIQPPTQLS SIYGYITADL FLRGLQGQQG CPTRESYIRG LRGVTDYDGG GLLNQPVDLS AGKRTVDLCT DFVRVSAAGN AFEPVGAKPL CGQVISGAAQ GGAGASVPAR GR
|
| |