Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0044 |
Symbol | |
ID | 7407279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 56428 |
End bp | 58248 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714454 |
Product | von Willebrand factor type A |
Protein accession | YP_002571979 |
Protein GI | 222528097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTTT CACAGCCACT TTTTCTTTTG TTTCTTTTGA CAATCCCGAT ATTAATTTTT TTGTATATGA TAAAACCAAG GCACAAGAAA GTAACCATTC CAAGCACGTT TTTATGGGAG CGATTGAAAA AACAAAAAAG AGTTGTAAAG CCGGCGCAAA AGTTAAGATT CAGCCTGCTT TTACTTTTGG GGATTTTTGC ACTTTTCAGT TTTTCAATGT ATCTTGCAGC TCCTAATATA TTTATAAAAA ATACCCGCAA AAATGTAGTT TTTGCCTTTG ATAATGGTGC TTCTATGAGC TGTGTGGCAG ATGATTCTAA GTCATATCTT CAAAAGTCAA AAGAGATAGC AAGAGAGATA ATAGATACAC TGCCGGGCAC AACAAAGGTC AGCATTGTGA CTTTTTCAGA CACAGTCGAG ATTTTGCAAA AAGAAGGCAC GCGCAGCTTT GCCAAGGATG CGATTGACAG GATAGTTCAG ACGTACTATG TAACAGATGT GAAAAAGAGC TTGAATATTA TTTCAAAGCT TTTCAAACCC TCAGATACAA CTTTGTTTAT ATTTACCGAC AAGAATATGC CTCACTTAGA AAATGCAAAA CTTTACAAGT TTCCAAAACC AAAAGATAAT GTTAGCATTG AAAATATATC TTGTGTCTCA AGAAAAGATG GTTTTGATGC TCTGGTAGAA ATTAAAAACA GAGGAATTCA AAAAGCGAGT TTTGATCTTG AACTTTTTGC AGACAGCAGA CTGGTAGGAT TGAAAAGCAT TTCCCTTTTG CCAGGGAAAA TGGGTACTTT TTTGTTTGAA AATATCAAAG GAAATTACAA GGTTGTGTGG GGGAGAATAA ATTATCCAGA CGAAGTAACA AAAGACAATG TCTTTTGGAC AGTTTTAGAG CCCTTGATGG CAAAACAAAA AGTTTTGTAT GTAGGAAAAG GAAATTTCTT TTTTGAAAAA GTGTGGCTTA CTTTTGATGA CGTTGAATTT TATAAAACTC AAGATGTCAA AAACATTGCT GGTGATTTTG ATATATACAT ATTCGACCAG TGCATACCAC AAAAGTTTCC TCAAAAAGGA GCTTTCATAT TTGTATTGCC AAAGAACAGC AGTGCCTTGA AGCTTTTAGG GATAAAGATA GGGAAAAATG CAAGCAATGA AGGATATGCA AGGTTTGTAA AGTCAGCTAT TTCACAAAAC ATTATTGGGA TGGATTTTGC TGTGCAAAAG GCTGTGTATA TTGATGATAG TGCTTTTGAG CCAGTTGCAA AGATAGGTGG CAAGCCAATA ATCTCTTTTG GCAGTATCCA TAATCATCCG TCAATCCTAT TTGGTTTTGC GATAGATAAC AGTGACCTGC CTTTGAAGGT ATCTTTTCCA ATTTTGATGG CAAACATTAA AAGTGCTTTT GTCAAGAAAA ATCAGTTATT TGAAAAGACA GCTTTTTATC CTGGTGAAGA GATAAGAGTT TTTTCATATT CAGATAAAAA AGCAGATTTG GTACTACCCG ATGGGAAAGA AGAAATTGTT GATTTGTCAG GTTATCCCTC AATTCTTCCT AAAAAAGATG TGTTGGGAGT TTATTCTTTG GTTCTAAATG AAAAGTCAAA AGAGAATTAC AAATTTGCAA TAAATTTCCC AACCTATGCT CTGGATGATT CGGGCAGTAG AAATTTGGCA GAAAATAATA CCAACTTTCC AGATGCAGGT AAAATCAATA GCGTGAAAAT GCCATATTCT TTGAAAGATA TATTTTTGAT ACTGGCACTT ATTTTTTTGA CTTTGGAGTG GATGGTGTTT TTAAATGAGA ATAGAGTTTG A
|
Protein sequence | MNFSQPLFLL FLLTIPILIF LYMIKPRHKK VTIPSTFLWE RLKKQKRVVK PAQKLRFSLL LLLGIFALFS FSMYLAAPNI FIKNTRKNVV FAFDNGASMS CVADDSKSYL QKSKEIAREI IDTLPGTTKV SIVTFSDTVE ILQKEGTRSF AKDAIDRIVQ TYYVTDVKKS LNIISKLFKP SDTTLFIFTD KNMPHLENAK LYKFPKPKDN VSIENISCVS RKDGFDALVE IKNRGIQKAS FDLELFADSR LVGLKSISLL PGKMGTFLFE NIKGNYKVVW GRINYPDEVT KDNVFWTVLE PLMAKQKVLY VGKGNFFFEK VWLTFDDVEF YKTQDVKNIA GDFDIYIFDQ CIPQKFPQKG AFIFVLPKNS SALKLLGIKI GKNASNEGYA RFVKSAISQN IIGMDFAVQK AVYIDDSAFE PVAKIGGKPI ISFGSIHNHP SILFGFAIDN SDLPLKVSFP ILMANIKSAF VKKNQLFEKT AFYPGEEIRV FSYSDKKADL VLPDGKEEIV DLSGYPSILP KKDVLGVYSL VLNEKSKENY KFAINFPTYA LDDSGSRNLA ENNTNFPDAG KINSVKMPYS LKDIFLILAL IFLTLEWMVF LNENRV
|
| |