Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3412 |
Symbol | |
ID | 3905652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4053575 |
End bp | 4054945 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880735 |
Product | von Willebrand factor, type A |
Protein accession | YP_482495 |
Protein GI | 86742095 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0316076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA GGGACCGGCT CGGCGTCGAG GGCCGGATGG ACCGGGGTGT CACGAAGCCG GGTGGGGTGC TCCGGCTGAC GATCAGCGTG GCCGATCCGC CGCGGGGCCG GCGGGAGGTG GCGGTCGCCG TCGCGGTCGA CGCATCCCTG TCGATGCTGC ATCCGGCGGC GCCGGACGCG GTCAGCAAGT GGGAGCTCGC GCAGCGGGTT CTGGAACGCG TGCACGGCCT GCTGCCCGAC GGCTGCCCGC TCGTGCTGGT GCGCTTCGCC CGCGAGGCGC GAGTGATCAG CGCCGGACGT CGGCCCGACC GCCTGCCGGC CGTTGACGGC CCCACGAACA GCGACGCGGA CTCCTACACC AACATCGGCG ACGCGCTCGC CGTCGCCGGG CGGGAGCTGC TCGCGCAGGC GCCGGACGCG GACGTCTACC GGGTCCTGCT CATCACCGAC GGTGAGGCCA ACATCGGCCG CTGGCAGCCC GAGCAGCTCG CCCACATCGT CGCCGGGCTC GGCGACCAGG GCATCGGGAC GGACGCGCTG GGCATCGGCG TCGACGCCCG GGACGAGGTG CTCGCCGAGA TGGTCGGGGT GTGCGGCCAG ACGCAGCACG TCTGGGCGGC GGCCGAGGAC GTCGACCGGA TCGACGCGAT CGTCTCGTCG CTGATCGAAC CGGTGGTCGG CGCGGCCGCG GGGCGCGGTG AGCTGCGGGT CCTGATCCGA CCCGGCTGGA CGGTGGAGGC GCTGCGGCGG ATCAAACCGC AGCGTCACCT CATGAAGGTG CCGCCACCGA CCTCGCGCGG CACCGAGCTG CGGCTGCCGC TACCCGCCGT GAGCACCGGC GAGGACCGTC CGGTCTTCCT GCTCCGGTTG CGCGCTCCGG ACACACCGCT CGGCGCGCAC ACGATTCTTC AGGCCCGCGG CGGGGTGGTC GCCGGTGGAC GGCGGCTGGC TGTGGACAGC CGGGCCGACC CGCACGCCGC GCAGCGCTTC CGGGCCACCG TCGAGGCGGA TGTCTTCCCG GACGCGGAAT CCTCCCTCGA ACACGAGGAG AGCATCGCGG AGTTCGACGA GGAGATTGCG AAGCGGGCCA GCTTCGCCCA TTCCCGGGAG AGCGTCACCG AGCTGTTCCG GAACGCGGCG CTGTGGGCGG CGGACCGCGG ATTCGACGAT CTGGCCGACC ACTACAACGC GACGCTACGG GCCCTCGGCC AGGGTGTCGC GCCGGCCGAC GCGGTGTCGG GAGCCCGGGT CCGGGCGACC CGCACCAGGA CCCGGACCCG TGATCTCGTC GAGGCGGACC CCGTGTCCGG CCGGCAACAC GGCGGGGCGG CGTCCCGTCG GCGTCTCGAC GACGTCCTCG GCAACAGCTG A
|
Protein sequence | MNDRDRLGVE GRMDRGVTKP GGVLRLTISV ADPPRGRREV AVAVAVDASL SMLHPAAPDA VSKWELAQRV LERVHGLLPD GCPLVLVRFA REARVISAGR RPDRLPAVDG PTNSDADSYT NIGDALAVAG RELLAQAPDA DVYRVLLITD GEANIGRWQP EQLAHIVAGL GDQGIGTDAL GIGVDARDEV LAEMVGVCGQ TQHVWAAAED VDRIDAIVSS LIEPVVGAAA GRGELRVLIR PGWTVEALRR IKPQRHLMKV PPPTSRGTEL RLPLPAVSTG EDRPVFLLRL RAPDTPLGAH TILQARGGVV AGGRRLAVDS RADPHAAQRF RATVEADVFP DAESSLEHEE SIAEFDEEIA KRASFAHSRE SVTELFRNAA LWAADRGFDD LADHYNATLR ALGQGVAPAD AVSGARVRAT RTRTRTRDLV EADPVSGRQH GGAASRRRLD DVLGNS
|
| |