Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2954 |
Symbol | |
ID | 5671340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3477569 |
End bp | 3478816 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641241860 |
Product | hypothetical protein |
Protein accession | YP_001507280 |
Protein GI | 158314772 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.189818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAT TATGTGTCGT CGTCGTGGTG GCGGCGCTCG CCGCGGTAAT ACTGGCTGCC TGCGGCTCCG GCTCCGGCTC CGGCTCCAGC GCCGGCGGCG GCATTGGCAC AGCAAAAGGG GACCCGATCG TCGTCGGGAC GATTTGCAGC TGCTCGGGAG CCCAGGCGGC CTCGACCGCC AAGGTCAAGG ATGTCGCCCA GGTCTGGGCC GAGTCGGTCA ACGCCCGCGG TGGCATCAAC GGCCATCCCG TAAAAATGAT TGTCGAGGAC GATGCCGGCG ACCCGGCCAA GGCTCTGCGG GCCGCGAAGA AACTGGTCGA GCAGGACCAC GTCGTTGCCA TCGTCGGCCA GATGAGCCTC GCCGACGCCA CATGGGCCGA CTATACGTCG AAGAAGGGTG TGCCGGTGGT CGGCGGGATC TCCGCCACGA TCCCGTACCT TACGGAGCCG ACCTTCTTCG CCTCGGGTAC CACATTGCCG GCGCTTGTGG CAGGCACCCT GCTGACGGGA AAGGAAGCAG GCAAGACCAA GCTCGGGGTA TTGTACTGCT CCGAGTCGCC CTCGTGCTCG CAACTTGACC TGTTCACCAA GGCAGCTGCT GGGATTGTGG GAGGCATCAT TGTCACGTCG GCGAAGATAT CCTCGACCGC GCCGAACTAC ACAGCTCCTT GCCTGGGACT GAAGCGGGCC GGTGTGGACA GCATAACCCT CTCGGCGAAC AACACGGTCG TAGCGCGCGT CATGGAGGCC TGCGCGCAGC AGGGACTCCA TCCCACCATG GTCAGCCAAT TCGCCACGAG TGCACCGGAG TGGCTTCGTA ATCCCAACTT CGAGGGCGCT TTGCTGACAT CGCAGACCGC CGTGCCGGTC GCGTCCGCAC CAGCCGGCAA AGAGTTCCTG GACGCAGTGG CAAAATTCGC CCCTGGGCTC ACCCGTGACC CTCAGTTCAC CCCGGCTACG ATCCTGGCAT GGGCCGGCGG AAAGCTGTTC GAGGCGGCGG CCAAGGCAGG TGGTGTCGGA CCGAACTCGA CGCCCACTGA TGTCATGAAG GGCCTCTACA CCCTCAAGAA CGAGACGCTC GGTGGCCTCG CTCCGCCACT AACCTTCACG AAGGGCAAGC CCGCTGTCCC GACCTGCTAC TTCACCACCC AGATCCATAA CGGCGAGTAC ATACTGCCGG ATGGGCCCGC GCCTAAGTGC CTAACCCCGT CCCAGGTCGA GGCGTTCGGG AAATTGCTCG CGGGATAA
|
Protein sequence | MKRLCVVVVV AALAAVILAA CGSGSGSGSS AGGGIGTAKG DPIVVGTICS CSGAQAASTA KVKDVAQVWA ESVNARGGIN GHPVKMIVED DAGDPAKALR AAKKLVEQDH VVAIVGQMSL ADATWADYTS KKGVPVVGGI SATIPYLTEP TFFASGTTLP ALVAGTLLTG KEAGKTKLGV LYCSESPSCS QLDLFTKAAA GIVGGIIVTS AKISSTAPNY TAPCLGLKRA GVDSITLSAN NTVVARVMEA CAQQGLHPTM VSQFATSAPE WLRNPNFEGA LLTSQTAVPV ASAPAGKEFL DAVAKFAPGL TRDPQFTPAT ILAWAGGKLF EAAAKAGGVG PNSTPTDVMK GLYTLKNETL GGLAPPLTFT KGKPAVPTCY FTTQIHNGEY ILPDGPAPKC LTPSQVEAFG KLLAG
|
| |