Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4474 |
Symbol | |
ID | 5675736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5338138 |
End bp | 5339391 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243341 |
Product | hypothetical protein |
Protein accession | YP_001508757 |
Protein GI | 158316249 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTT GTCGTCCACG GCGGCTGAAC GCTGCCGTCA GTGTGATCGC GATCAGCGGC CTGACGAGTG TGGTGGCGGC CTGCGGGATC ATGGGCGGGG GAGACGGGTC CGGGCCAGCG ACCGAGGGCT GTGCGACACC CGGTGTGACC TCGACCCAGG TCACTCTCGG AACGGCCATT GACGACACGG GGATCGGCGC GGGTGCCCTC GCGGCGTTCC GCGCCGGGAT CGATGCCCGT CTCGGCGTGG CGAACGACAA CGGCGGGGTC AACGGCCGGA AGGTCGTCTA TGAGTGGAAG GACACCCAGT CCGACCCGTC GTCCACCCAG AACGTCGTGC GCGAACTGGT CGAGACCAAG GGCGTGTTCG GCATCATCCA GGGCTCGATC ATGGCCCTCA GCTCGGCGGA CTACGTCGAG GAGCACGGCA TCCCGGTGGT CGGGCCCAGC ATGGACGAGT CCTGGCCAGA CCACCCGAAC ATGATCAGCT GGTTCTACGT CCAGTCCGCG AGATGGTCCG TAAGCACCTG GGGTGACTTC GCGCGGTCCC AGGGGGCTAC CCGCGTGGCC ATCCTCGGCT TGGCCCTCAA TCCGGGGACC TACGAGGCTC AGCTACAGGC GAGTATGGAA TCGGCCGGGA TCCCGGTCGT CCTCAACCCG GACGTGACGG TCGGTGCCAC CAGTTTCAGC CGACTGGCGC AGGAGTTGAA GGCCGCGAAT GTCGACACGA TCACTGGTGC GGTGACCCCC GACGTCCTGG CCCAGCTCAT GCCGGCCGTG CGCAACGCTG GCCTCCAGCT GAAGCTGGTG ATGACACCCA CGGGCTACGA CCCTGCCCTG CTCCAGCGAC TCGGCCCGCA GGTCGCCGGA ACGACGATCT ACGTTGATTT CGCGCCGTTC GAACTGAACC TGCCGGCCCA CCAGACATTC ATCTCCGCCA TGGCCCGGTA CGCACCCGAG ACCCAGCCGC CGCAACAGCA GAGTGCCGTG TGGGGATGGA TCTCGGCCGA CCTGTACCTG CGCGGGCTGC AGGACGCCGG CGAATGCCCG ACCCGCGAGG GGTTCGTGAA CGCGTTGCGC GCCGTCCACG ATTACGAGGC GGGCGGACTC CTGTCCAGCA AGGTCGACTT CGCGACCAAC ATCGGGCAGC TCAGCACCTG CTACCAGTTC GTGCAGATAT CCCAGGACGG AAAGGAGTTC ATCCCGCTCA ACCCCAGCCA GCGCTGCGGT ACCATACTGA GCCGTTTCCG GTAA
|
Protein sequence | MPFCRPRRLN AAVSVIAISG LTSVVAACGI MGGGDGSGPA TEGCATPGVT STQVTLGTAI DDTGIGAGAL AAFRAGIDAR LGVANDNGGV NGRKVVYEWK DTQSDPSSTQ NVVRELVETK GVFGIIQGSI MALSSADYVE EHGIPVVGPS MDESWPDHPN MISWFYVQSA RWSVSTWGDF ARSQGATRVA ILGLALNPGT YEAQLQASME SAGIPVVLNP DVTVGATSFS RLAQELKAAN VDTITGAVTP DVLAQLMPAV RNAGLQLKLV MTPTGYDPAL LQRLGPQVAG TTIYVDFAPF ELNLPAHQTF ISAMARYAPE TQPPQQQSAV WGWISADLYL RGLQDAGECP TREGFVNALR AVHDYEAGGL LSSKVDFATN IGQLSTCYQF VQISQDGKEF IPLNPSQRCG TILSRFR
|
| |