Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0147 |
Symbol | |
ID | 5668572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 176507 |
End bp | 177967 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239076 |
Product | ExsB family protein |
Protein accession | YP_001504520 |
Protein GI | 158312012 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases [COG1606] ATP-utilizing enzymes of the PP-loop superfamily |
TIGRFAM ID | [TIGR00268] conserved hypothetical protein TIGR00268 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00645126 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCCCCT GGAGAACATC CACCCTGGTC GTCGGTTTCG ACCTGGACAT GACCCTGCTG GACGCCCGCC GCGGGATCGT CGCCACCTTC GCCGAGCTGG CCGCCGAGAC CGGCGTCACG ATCGACGGCG AGGCCGCCGT GCGCCGTCTC GGCCCCCCGC TCGAGGACGA GATCGCCCGC TGGTTCCCGG CGGACGAGGT GGCCCGGCGG GCCGCCCGCT ATCGCGAGCT CTACGCGGTG CACGCGGTGC CCGTCTCCGT CGCGATGCCG CACGCCGCCG AGGCCGTCGA GGCGGTCCGC AGCGCCGGTG GGCGGGTCGT GGTCGTGACC GCGAAGAGCG AGCCGCTGGC CCGGGCCAGC CTGGAGCACA TCGGAATCAC AGCGGACCAC GTCGCGGGCT GGCTGTTCGC CGAGACCAAG GGCGCGGCGA TCGAGGAGCA CGGCGTGGAC GTCTTCGTCG GCGACCACGT CGGGGACGTC CACGGCGCGC GCGCGGGCGG AGCGGCCTCC GTCGCGGTGC CCACCGGCCC CTGCCCGCCG GACGACCTGA CCGCGGCCGG GGCGGACGTC GTCCTGCCCG ACCTGGCGGC CTTCCCGGCC TGGCTGGCGG ACGAGGTGCT CCGCCGGCGC CTGGACACGC TCAGCGCACG GCTGGGTGAG CTCGGATCGG TGCTGGTCGC CTTCTCGGGC GGCGCGGACT CCGCCTTCCT GCTGGCCGCC GCCGCGCGGG AACTCGGCCC CGATGCCGTG GTCGCGGCCA CGGCCGTGTC GGCGTCCCTG CCCGCGGCCG AACTCGACGC GGCCCGCCGC TTCGCCACTG GCCTCGGGGT GCGCCACCTG TTCCCGGCGA CCGACGAGAT GAGCCACGAG GGCTACCGCG CCAACAGCTC CAACCGTTGC TACTTCTGCA AGTCCGAGCT CGTCGACACA CTCGCGCCGC TCGCCGCCGA GCTGGGCCTG GCGCACGTGG TCACCGGCAC CAACGCCGAC GACGCCCGTG CGGGGTTCCG GCCCGGCATC GGCGCGGCGG CCAGCCGGGG CGCGCGCACG CCGCTGCTGG ACGCCGGTCT GACCAAGGCC CAGGTGCGCG CCGCCTCCCG CACCTGGGGG CTGCCGACCT GGGACAAGCC GGCGGCGGCC TGCCTGGCGA GCCGGATCGC CTACGGGGTG CGGGTCAGCC CGGCCCGGCT CGCCCGGGTG GAGCGTGCCG AGACGGCGCT GCGGGTGGCC ACGGCAGCGG CCGGCCTGCA CATCCGCGAC CTGCGGGTGC GCGATCTCGG GGACGTCGCC CGGATCGAGG TGGATGCCGA CCACGTGGCC GGACTGGTGG CCCGTCCTGA TCTGGTCTCG GTGGTCGTCG AGTCCGGTTT CGCCCGCGCG GAGGTCGATC CCCGGGGCTT CCGGTCCGGC TCGATGAACG AGCTCCTTCC CGCCCCCGGG CGCCAGACCG AGCCGGCCTG A
|
Protein sequence | MSPWRTSTLV VGFDLDMTLL DARRGIVATF AELAAETGVT IDGEAAVRRL GPPLEDEIAR WFPADEVARR AARYRELYAV HAVPVSVAMP HAAEAVEAVR SAGGRVVVVT AKSEPLARAS LEHIGITADH VAGWLFAETK GAAIEEHGVD VFVGDHVGDV HGARAGGAAS VAVPTGPCPP DDLTAAGADV VLPDLAAFPA WLADEVLRRR LDTLSARLGE LGSVLVAFSG GADSAFLLAA AARELGPDAV VAATAVSASL PAAELDAARR FATGLGVRHL FPATDEMSHE GYRANSSNRC YFCKSELVDT LAPLAAELGL AHVVTGTNAD DARAGFRPGI GAAASRGART PLLDAGLTKA QVRAASRTWG LPTWDKPAAA CLASRIAYGV RVSPARLARV ERAETALRVA TAAAGLHIRD LRVRDLGDVA RIEVDADHVA GLVARPDLVS VVVESGFARA EVDPRGFRSG SMNELLPAPG RQTEPA
|
| |