Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0368 |
Symbol | |
ID | 5668792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 437566 |
End bp | 438852 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239300 |
Product | von Willebrand factor type A |
Protein accession | YP_001504740 |
Protein GI | 158312232 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00177271 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000715031 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCGT TCACGGCGAA GGTGTACCAG AACGAGTTCC TGCCGGTCGG CGGGACCCAG GTACACGCGG TGATCACGGT GACCTCGACG GGCGCCCCGG CCGCCCCCCC GATCGCGGGC CGGCCGACCG GGCGTCCGGA GCAGGCCCTG GTCATCCTGC TCGACTGCTC CGGCTCGATG GCGAACCCGC CCGCGAAGGT CACCCAGGCC CGCCGCGCCG TCCGGGCGGC GCTCGACAGC CTGCCCGACG GGGCGTGGTT CGCGGTGGTG CGCGGCACCG GCTCCGCCGC GATGGCGTAC CCGCGCTCGC CCGAGCTGGT TCCGGCGTCC GCGGCGACGC GGGCGGCGGC CTGCCACGTG GTGGACGCGC TCGAGCCGCA CGGCGGCACC GCCATGGGCC GCTGGCTGCG GCTGGCGAAC GACCTGCTGG CGACCCGGCC CGACGCCATC GGCCACGCGC TGCTGCTCAC CGACGGGCAG AACGGCGAGA TGGAATCCGA GCTGCTCGGC GCCGTCGACG CCTGCCAGGG CCGGTTCCAG TGCGATTGTC GAGGGGTGGG CACCGACTGG CGGGTGGAGG AGCTGCGGGC GATCGCCACC GGCATGCTGG GGACGGTGGA CGCCGTCCCC GAGCCCGCCG GCCTCGCGGC CGAGTTCGAG CGGATCGTGG CCACCGCCCT CGACCGGGCC ACCGACCGGG TCTCGCTGCG GCTGTGGACG CCCACCGGTG CCTCGCTGGA CTTCCTGCGC GAGGTCACCC CGGACCTGCG GGACCTCACC GGCTCCGGCC GGGTCGTCGA CGACCACTGC ACGGACTACC AGACCGGGGC CTGGGGGATC GAGTCCCGGG ATTACCACCT GTGCGTCCGC CTGCCGGCCC GCGAGGTCGG CACCGAGGTC CTGGCGGCCC GCGTAAGCCT CGTCGTCGAC CACCAGCCGA CGTCATCCGC GCTGGTACGC GCACTGTGGA CGGATGACAC GGCACTGGCC ACCCGGGTCA ACACGGAGGT TGCGCATTAC ACCGGTCAGG CCGAGCTGGC CAGGGTCCTG GCGGACGGGC TGGAGGCCCG TCAACAGGGA GACGACGTCA CTGCCACACT GAGGCTTGGG CGGGGAGTAC AGATCGCGCT CGCGTCAGGC AACGAAGCAA CCTACCGCCT CCTACAGAAG GTGGTTCACA TCGACGATCC CACAACGGGT ACGGTTCGTC TGAAGAAAAA CGTCGAGAAG ATGGACGAGA TGGTCCTCGA CTCGAGGTCA ACCAGGACTG TCCGGGTGAA CAGGTGA
|
Protein sequence | MSAFTAKVYQ NEFLPVGGTQ VHAVITVTST GAPAAPPIAG RPTGRPEQAL VILLDCSGSM ANPPAKVTQA RRAVRAALDS LPDGAWFAVV RGTGSAAMAY PRSPELVPAS AATRAAACHV VDALEPHGGT AMGRWLRLAN DLLATRPDAI GHALLLTDGQ NGEMESELLG AVDACQGRFQ CDCRGVGTDW RVEELRAIAT GMLGTVDAVP EPAGLAAEFE RIVATALDRA TDRVSLRLWT PTGASLDFLR EVTPDLRDLT GSGRVVDDHC TDYQTGAWGI ESRDYHLCVR LPAREVGTEV LAARVSLVVD HQPTSSALVR ALWTDDTALA TRVNTEVAHY TGQAELARVL ADGLEARQQG DDVTATLRLG RGVQIALASG NEATYRLLQK VVHIDDPTTG TVRLKKNVEK MDEMVLDSRS TRTVRVNR
|
| |