Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2282 |
Symbol | |
ID | 5670681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2726183 |
End bp | 2727367 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241202 |
Product | hypothetical protein |
Protein accession | YP_001506623 |
Protein GI | 158314115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.212652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAA CAAAACTGGG CGTGGTGGCC CTGGGCGCGA CGGCGGCACT GGCACTGGCG GCCTGCTCCG GTGACTCGAC GACGGGCGAG AGTAGTTCCG GCGGTGAGCG GGCGCCGGCG GTCGAGGCGG GTGGCGCCCT CGACCTCGCC GGGGTCTGCC CGGAGAACGT GGTGATCCAG ACGGACTGGT TCGCCGAGTC CGAGTACGGC TTCCTCTACA ACCTGATCGG CCCGGACGCC AAGATCGACA CCGGTGGCAA GCGCATCACC GGGTCGCTGG TCGCCCAGGG CAAGGACACG GGTGTCAACG TCGAGGTGCG CTTCGGCGGG CCGGCCATCG GCTTCGAGCA GGTCAGCTCC CAGCTCTACC TTGACCCGGA GATCGACCTG GGCCTGGTCT CCTCGGACGA GGCGATCCAG AACTCCAAGG ACCAGCCGAC CACGGCCGTC TTCGCCCCGT TCGAGGTCAG CCCCATTATG ATCATGTGGG ACAAGGTGAA GAACCCGAAC TTCCACACCC TGGTCGACAT CGGCCAGACC GACACGAAGG TCCTGTACTA CGAGACCGAC ACCTACATGC AGTACCTGCT CGGCGCGGGC ATCCTGCGGG CGTCCCAGGT CGACGGCAGT TACGACGGCG GCCCGTCGCG CTGGGTGACC GAGGACGGCG CCGTCGCGCA GGGCGGGTTC GCCACCTCCG AGCCCTACAT CTACAAGAAC GAGCTGGATG ACGGGCGCAG CTACGACGTC GACCTCCAGC TGATCAACGA CACCGGGTAC CCGGTCTACG GTCAGGCGCT GTCGATCCGC TCCGGTGACA AGGAGACCCT CGCGCCCTGC CTGAAGAAGC TGATCCCGAT CGTCCAGCAG TCGCAGGTCG ACTTCGTGAG CGACCCGGCC GAGACCAACG CCCTGATCAT CAAGGCCGTG CAGGCCGACG ACGTCTCGGT GTGGAACTAC TCGCCGGGCC TGGCCGACTT CGCCGTCACC ACGATGAAGG AGCGCGGCCT GGTTGCCAAC GGGCCGAACG CGGCCGTCGG CGACATGGAG GAGGACCGGC TGGCCCGGAT GATCGAGATC CTCGAGCCGA TCTTCACCGG CCAGCGCAAG GAGCTCAAGG CCGGCCTGGC ACCCGGTGAC CTGTTCACGA ACGAGTTCAT CAACACCTCG ATCGGCCTCA AGTGA
|
Protein sequence | MRRTKLGVVA LGATAALALA ACSGDSTTGE SSSGGERAPA VEAGGALDLA GVCPENVVIQ TDWFAESEYG FLYNLIGPDA KIDTGGKRIT GSLVAQGKDT GVNVEVRFGG PAIGFEQVSS QLYLDPEIDL GLVSSDEAIQ NSKDQPTTAV FAPFEVSPIM IMWDKVKNPN FHTLVDIGQT DTKVLYYETD TYMQYLLGAG ILRASQVDGS YDGGPSRWVT EDGAVAQGGF ATSEPYIYKN ELDDGRSYDV DLQLINDTGY PVYGQALSIR SGDKETLAPC LKKLIPIVQQ SQVDFVSDPA ETNALIIKAV QADDVSVWNY SPGLADFAVT TMKERGLVAN GPNAAVGDME EDRLARMIEI LEPIFTGQRK ELKAGLAPGD LFTNEFINTS IGLK
|
| |