Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3982 |
Symbol | |
ID | 5675726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4768615 |
End bp | 4770099 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242860 |
Product | hypothetical protein |
Protein accession | YP_001508277 |
Protein GI | 158315769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.399913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACTA GTACCTACCT GCAGGGTCGG GACAGCGACC GGGCAGCCGG ATCTGGCGTC CGGCTCGGAC GACCGGCTCG CCCAGGCCCG CGCGCTGCTG CGGCTCGGCC GGGTGCACCG GATCGACGTA CCGGATGGTT TCCAGGAGAC CAGCCCGGAC GTGCTGGACG CCTTCGCGCG CAAGGACCCC CGGCTGGCCG CCGCCTGCGA GGCGGGCCGG GCCGACCGGG CGATCACCCA GGTGCTGGCC GGCGGGGTGG ACCTGCTCGA GCGCTACACG CACACCCTCG ACGACCAGAG CCGGGCGCTG CTCACCGCCG CGATGGACAT CCGCCGGCTC GGGCGTTCCG GCCCGCTCCC GGGGCGCTGC TCGAGCAGGC GGCGCCCGGC TACCTCGCTC CCGGCCAGCG GGTCGCCGGT CCCGACTGGT TCGCCGCCGC CGTCCGGACC GCGACCGAGG AACTCCGTGG GGTCCGCGCG CTCACCCCGG TCCGGACCGC GCCCGGCGTC GGCGAGCCGG ACGGGTACCT GCTACACGAC TACCTGGAGC AACACGCCGA CGCGGCGCGC CGTGCGAATC CCGTACCCGC GGCCGTCTGG GAAGCACTCA CGGCGCACGC GGCTGATCCG GACGAGCTGG CGCGGCTCGC GGTGGAGGCG GAGTACCGAG GGCTGTACCA GTACGCGCAC CGGCTTGCGG TCGCGGCTGG GGACGACGCC CATGCGCTGG GGATCCTGGC TGACCGGATG AGCGCCGACG GATACGCCGA GGAGGCGCTG GTCGCCTATG AGCAGGCGGC GAGGACCGCC GGTTGGAGCC ATCCGATCGA GTACGCCGCC GATCGCCTAG AGAAGTCGGG ACACGTCGAC GAGGCCATCG CCTGTTACCA CAGGGCAGCG GAGATCTTCG GCAGGAGCGA GCCGCTGCGG CGGGCCGTGC ATCTGCTGGA GAGCGCGGGC CGCCAAGATG AGGCGCTCGA CTGCCGGATG CGGGCGCTCG AGATAGCCGC GAGCGCGCCG AGCCTACGTG CCCGAGCGGA AGAACTGCTT GCCGACGGCC GCACCGACGA GGCGCTGGGA TTGTTCCACC GTGCGGACCA GGCAGATGGT TCGGCGAAGC TGTTCCCGCG AGGTGGCGAT CTACTCGATC CGAGGACACT CGTCACCAAC TATCGGTGGC ATGCGCAGTC CTTCGATCTG CAGCACTGGC AGGACGTCCG TGACGTGGCA GAGCTGCTGG ACACGGCTGG GCGAGCCGCC GATGCGAATA GCTGGTTGCA GGGGCTGGCC GCCGCTTCGA CCACGACGCG GTCGAAGCGG CGTCCGCTCG ACTGGTCGCG GCGGGCCGCC AGGCGGACGC CATCACGTGG CTGCGGGAAC TGGCAGAGCG TGGCAACCGC CCGGCGCGAC GGGAAGCGGC CAAACAACTG CTCGCGGCGG GGCCCTACCT GCCAGGATGA GGTGAGACAG GCTCGGTTCC GGTAG
|
Protein sequence | MPTSTYLQGR DSDRAAGSGV RLGRPARPGP RAAAARPGAP DRRTGWFPGD QPGRAGRLRA QGPPAGRRLR GGPGRPGDHP GAGRRGGPAR ALHAHPRRPE PGAAHRRDGH PPARAFRPAP GALLEQAAPG YLAPGQRVAG PDWFAAAVRT ATEELRGVRA LTPVRTAPGV GEPDGYLLHD YLEQHADAAR RANPVPAAVW EALTAHAADP DELARLAVEA EYRGLYQYAH RLAVAAGDDA HALGILADRM SADGYAEEAL VAYEQAARTA GWSHPIEYAA DRLEKSGHVD EAIACYHRAA EIFGRSEPLR RAVHLLESAG RQDEALDCRM RALEIAASAP SLRARAEELL ADGRTDEALG LFHRADQADG SAKLFPRGGD LLDPRTLVTN YRWHAQSFDL QHWQDVRDVA ELLDTAGRAA DANSWLQGLA AASTTTRSKR RPLDWSRRAA RRTPSRGCGN WQSVATARRD GKRPNNCSRR GPTCQDEVRQ ARFR
|
| |