Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0635 |
Symbol | |
ID | 5669052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 737094 |
End bp | 738779 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239562 |
Product | hypothetical protein |
Protein accession | YP_001505000 |
Protein GI | 158312492 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000516706 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.674007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGAGC CGACCCCCCG GACGCTGCCG GGGTTCACAC CCGCCGCATG GGGACTGCTC GCCGCGACGG CCGCCTGCGC CCTCGGGGCG GTGCTCCTGC GCTACGCGGA GCTCGCCGCC TTCGCCGGTG CCGGCGCGGC GGCGCTGTTC ACCGCGGTGG CAGCGGTGGC CCGTCCGCCG CGGGTCACCG TCACGACACG GGTCACCCCT GCCGCGGTCA CCCGCGGGGA CGACGCCGCC CTGGTCATCT CGATCGTGAA CCACTCCCGA TGGACGTCTC CCTCGTTCGC CCTACGCCTG CCAGCCGAGC CGGCCCAGCC GGGCGAGGTG CCCGCCGAGC CCGCCGAGCC CGCCGAGCCC GGCGGGTCTG TCGGGTCCGA CGGGCCCGAG ATCGCCGTGG ACATCCGCCC GTTGCGCGGT GGCGCCAGCC GGGAGATCGT CCTGCCGCTC GACACCGCGG CTCGGGGAGT TCGCCGGATC GGGCCCCCGC AGGTGCACCG GTCTGATCCG TTCGGGCTGG CCCACCGCCA CCAGTACCTC GGCACGAGCC TCACCCTGCG GGTCCGCCCC CGCGCATACC CGCTGGTCCC ACCGCCGGCC GCCCCGGCCC GTGACCCCGA CGGGCAGAGC GGACGTGGCG CGTCCGGGGG GCTGATGTTC CACACCCTGC GCGAGTACAC CCCCGGGGAG GACCTGCGCC TGGTGCACTG GGCCGCCAGC GCGCGCACGG GCACGCTGAT GGTCCGAACG CACCTGGACC CCAGTGAGCC CGCCTCCACC GTCGTGCTGG ACACCCGCCG GCGGGCCTAC CCGCCCGGCC CGGTCGGGGC GGCCGTCTTC GAGGACGCCG TCGACGTCGC CGCGTCCGCG GTGCTCGCCT GCGCCCGCAA CTCCTACGGC GTCCGCCTGG TCACCTCGGG CGGGGTGCGG ATGACCGGTC GCCGACGCTC CACCGACGCC GAGTCCCTCC TCGACGAGCT GGCCGACGTC CGGCCGGACG AGGGCGTGAC CCTGGATGTC CTGCGTACCC TGCGCCGCGG GCCCGTCGGC ACCCTCGTCC TGGTGACCGG CGCGCTCGAC CGGGACGCGG CGGCGGCGCT CGCCCCGGTG GCCCACGTGT TCGGCCAGGT GATCGTGCTT CGGATGGGCC CGCGCAGCGA GGCCGCCGCG CTGGCCCGGG GCCGGCGCGC CCTCGGCGAC CGCCCACGCC TGCGCCCGAG TGCGGAGGCG GTGGCACGGG CCCGCGCGGA ACGAGGCGGC CCGGTCGCCC CCGCCGGGCC CACCCGCTCG ACGGGACCGG CGGGCGCGGC CCGTACGGCA GGTGCGACCC GCATGGCCGG TGTGGCCGGT GCCGCTGGTG TGGCGGGGTC GGTTGGCATG GTCGGCACCG CCGGTTCGGG CCGGGTGCGG ATGATCCACC TGGGCTCACC CGCCGACCTG ACCGACGTCT GGCCTGCCGC GCCCCTCCCG CCCCGGGCGC CGGCCGCGGA CCAGGCGGCC GCGAGTCCAG CGGCAGTGGG CTCGGCGGGA TCGGGTTCGG CGGGATCGGG TTCAGGGGGA TGGGGTCAGG CCGAAACCGA GCCGGACCTG GCGGGACCGC TGCTACCGGT GGCCTCCCTG GCGTCGGCCG GCCCCGGCCC CGGCCCCAGG CCCAGGACTG GGCGCGGCGC GGGCGGCGGA TCGTGA
|
Protein sequence | MSEPTPRTLP GFTPAAWGLL AATAACALGA VLLRYAELAA FAGAGAAALF TAVAAVARPP RVTVTTRVTP AAVTRGDDAA LVISIVNHSR WTSPSFALRL PAEPAQPGEV PAEPAEPAEP GGSVGSDGPE IAVDIRPLRG GASREIVLPL DTAARGVRRI GPPQVHRSDP FGLAHRHQYL GTSLTLRVRP RAYPLVPPPA APARDPDGQS GRGASGGLMF HTLREYTPGE DLRLVHWAAS ARTGTLMVRT HLDPSEPAST VVLDTRRRAY PPGPVGAAVF EDAVDVAASA VLACARNSYG VRLVTSGGVR MTGRRRSTDA ESLLDELADV RPDEGVTLDV LRTLRRGPVG TLVLVTGALD RDAAAALAPV AHVFGQVIVL RMGPRSEAAA LARGRRALGD RPRLRPSAEA VARARAERGG PVAPAGPTRS TGPAGAARTA GATRMAGVAG AAGVAGSVGM VGTAGSGRVR MIHLGSPADL TDVWPAAPLP PRAPAADQAA ASPAAVGSAG SGSAGSGSGG WGQAETEPDL AGPLLPVASL ASAGPGPGPR PRTGRGAGGG S
|
| |