Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3079 |
Symbol | |
ID | 9146991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3426258 |
End bp | 3427352 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003638161 |
Protein GI | 296130911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0708582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000341622 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTGGGTG CGGTCGTGAC CGGGCTCGAG GGCGTCGTGC CGGTCGGCGT GGTGCTGGCG ACCGCGCTGC CCGTGCTGGC GCTGTGCGTC GGGCAGGCGC TGGGCCTGGC GCTGGTGCGA CGCGGGCGCG GGCTGCGCGT CACGCGGTCC GACGTGCCGC GCGCGGGTGC GGCCACGTGG GCCCGCCGGG CTGCGCTCGT GGTCGTGCTG GCGGTCGTGG GCCTCACGCC GACGACCGGC GCGACCCAGC AGGGCGGCGC GAGCGTCGGC GTGCAGGTCT GGTTCGTCGT CGACCGCACG GGCTCGATGG CGGCCGAGGA CTGGGGCCCG GGCGACCTGG TGGTCCGCGC TCCCGGGCTC GCGCCGGTGC CCGCGGACCA GCGCCTCGAC GGCGTCCGTC ACGACGTGGT GAGCCTCGCG CGGGACGTGC CCGGCGCGTC GTACACCGTC ATCGCCTTCG CCGACGAGGC GGGCACGCAG CTCCCGCTCA CGCAGGACTC GACGGCCGTG CGCGCCTGGG CGCAGACCGT CACGCAGGAG GTCACCGCCG GGTCCTCCGG CACGCGCCGC GACCGCGCGC TCGACGTGCT CCTGCGGTCC CTGCAGGACG CCCGCGACCT CGACCCCGCC ATGGTCCGGC TGGTGTTCTA CCTGTCCGAC GGCGAGCAGA CGTCCGACGC GGAGGTCGCG TCGTTCGCGC AGGTCGCCGA GCTCGTCGAC GGCGGCGCCG TGCTCGGCTA CGGCACCGCG GAGGGCGCGC CGATGCGCCG GTTCGACGGC ACCGTCGACC CCGACGCCCC GTACATCCCC GACCCCGACG ACCCGTCGCG GCCCGCCCTG TCGTACGCCG ACGAGACGAC GCTCCGCGAG GTCGCCGCCG AGCTCGGTGT GCCGTACGTG CACCGTGACG GCCCGTCGCC GACGGCCGAC CTCGTGGCCG GCCTCGACCC CGAGGCCGTC GCCGCGGACG GGCGCTCGGA CGTGGCCACG CGCCGTCACC TCGTGTGGCC CTTCGCGCTG GTGGCCGGCG TGCTGCTCGC CGCCGAGGCG TGGGCGTGGG CGCGGGCGTC GAGCCGACCG AGGAGGTCAC GGTGA
|
Protein sequence | MVGAVVTGLE GVVPVGVVLA TALPVLALCV GQALGLALVR RGRGLRVTRS DVPRAGAATW ARRAALVVVL AVVGLTPTTG ATQQGGASVG VQVWFVVDRT GSMAAEDWGP GDLVVRAPGL APVPADQRLD GVRHDVVSLA RDVPGASYTV IAFADEAGTQ LPLTQDSTAV RAWAQTVTQE VTAGSSGTRR DRALDVLLRS LQDARDLDPA MVRLVFYLSD GEQTSDAEVA SFAQVAELVD GGAVLGYGTA EGAPMRRFDG TVDPDAPYIP DPDDPSRPAL SYADETTLRE VAAELGVPYV HRDGPSPTAD LVAGLDPEAV AADGRSDVAT RRHLVWPFAL VAGVLLAAEA WAWARASSRP RRSR
|
| |