Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0045 |
Symbol | |
ID | 7407280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 58232 |
End bp | 60940 |
Gene Length | 2709 bp |
Protein Length | 902 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714455 |
Product | von Willebrand factor type A |
Protein accession | YP_002571980 |
Protein GI | 222528098 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAG AGTTTGAAAG ACCTTTTATT TTACTTGCTG CAGCTGTGCT TGGAGTATTT ATTTGGCTTG TTTCAAGAAG ATTTTCAAAA GAAAGTTTGG GTAAAAGATT TGTTGTATGG GTGAGGATTG TTTTAATAAC GCTCATAATC CTGGCTCTAA GCGTGCCAAG CTTGGCAATT TCAACCGACA AAATAACAAC AATTTATCTT GCGGATATGT CAGAGAGCAA TAGAAAGAAC GCAGAAAAAA TGAAAGACTT TATTCAAAGG TCGATAAAGC TCAAAAAATC AAATGAATTG CAGTCAGTTG TTGTGTTCGG GCAGGATGCA AACATTGAGT TTACACCGAC TAAGTATCCT AACTTTTCTG AATTTGGAAC GGCTGTAGAT AGCACTCAAA CCAATATTGA AAATGCAATA AAGTATGCAG TAAATCTATT TGACAAGGAT CAACAGAAAA GACTTGTCAT TCTAACAGAT GGGAAAGAGA CAATAGGTGA GGCAAAAAAT GAAGTAGAAC TTCTCAAAAA TAATGGAATA GATGTAAAAG TAGTGCTATT TAAAAGTGAG AAAGAAAAAG ATGTTCAATT TTCAAGTTTG AAGATTCCGC AGAAGGTATT TAAAAATCAG GCGTTTTCGA TATTTGCCAA TATAAACACC ACTTTTTCAA CAAAAGCTCT GCTCAAGATT TTCAGAGACA ATGATTTAAT TTTCAGCCAG CAGGTGACGC TTGAAAAAGG TGAAAACAGG TTTGTGATAA AGGATAAACT GGACAAGGAA GGGGTTTTCA GCTATCAAGG AGAAGTTGAA GCGATAGATG ATGAAGAAGA GTCAAACAAT GTTGCGTATG CTATGTGTCA GGTAAGAAAA CCTCAAAAGG TTCTGGTAGT TTATGAGGAT ATTGATGATG TGAAAAATGT CAAAAATCTT CTTGATTCTT ATTCAGCCGA TTATGATGCA GTAAAGAGCG ACCAGGCAAA CTTTGGGCTT GACAAACTCT TAGGGTATTC TTTTGTGATT TTGTGCAATG TGTCAAGAGA AAGCTTTAGC GATGAGTTTT TAAGCTCTGT AGAGAAGTAT GTGAAGGATT TAGGTGGTGG ACTTTTGGTA ATTGGCGGCA CAAATTCATA TGCGCTTGGG AATTATTCTA ATTCGGTTTT AGAAAAGATG CTGCCTGTCA AGATGGAGAT TAAGAACAAA GAAAAGGAAA AGAACATAGA TGTTGTACTT GTCCTTGACC ATTCAGGCAG CATGGCAGAT ACAGAAGACG CAGGTATTCC AAAATTAGAG ATTGCCAAGA GTGCTTCTGC AAAGATGATT GAGCACCTTG AAAGTTCAGA TGGTGTTGGT GTGATTGCTT TTGACCACAA TTATTATTGG GCATACAAAT TTGGTAAGAT AAGCAAAAAA GAAGATGTGA TAGAAAGCAT ATCAAGCATC GAAGTAGGTG GTGGGACGGC TATAATTCCA CCCTTGAGTG AAGCAGTTAA AACTCTAAAA AAGTCAAAGG CAAAAAGCAA GTTGATTGTG CTTCTGACTG ATGGCATGGG TGAACAAGGC GGTTATGAAA TTCCAGCCAA TGAGGCAAAA AGAAATAACA TCAAAATCAC CACAATTGGT GTTGGAAAGT ATGTAAACGC TACAGTTTTG AGTTGGATAG CCTCTTTTAC CTCAGGCAGG TTCTATTTAG TTTCCAATCC TTCTGAGCTT GTTGATGTGT TTTTAAAAGA GACAAAAATT ATAAAAGGCA AGTACATAAA GGAAAAGAAG TTTGTCCCCA AAGTGGTTGA GACAAATTCA ATAAACTCAG GTTTTTCCTC TTATCCACCG CTTTACGGTT ATATCGCAAC AACAAAAAAA GACCTTGCAA CCAGTCTTTT GGTAAGCGAC GAGGACGAGC CTATTTTGAC AGTGTGGAGG TACGGGCTTG GAAAGGTTGC TGCATGGTGT TCGGACTTGA GCGGCCAGTG GTCGCGTGAC TGGGTTTTGT GGGATAGATT TTCACAGTTT TGGACAAAGC TTTTCAAGTG GTTGGAAAAA GGGGCAGATG ACAGCAGCTA TGATTTTAAT GTTCGAAAAG AAAAAGATTT GGTGGTATCT TTGGTAGGCA AGTTCGATGC TGATACCATT GCAACTCTTA AATGCATGTA TCCAAACGGC AAAGAAAAGA CAATTGCAAT GAGAAGAACT GCACCAGATA GGTTCGAATC TAAAGTTGAT TTTATGCTGG GAAATTATGT ATTTGTGGCA ACCTTAAGTA ATAAGGACAA ATCCAAGGTA TCAACGTTTT TCTATTCAGC TAACTACTCT GATGAGTTTA GAGTGGATAT AGACAGCAGC AGGTTTGAAC AGTTTATCTC CTTATCAGGT GCAAAAGTTA TAAAAAATCC CAATGAGGTG TATGCGGGAA AGCTCAAGAG CACAGAAGAA AAGAGGGATA TTAGCAGTTT ATTAATAGTT CTTTGTATTG TATTATTCCT TTTGGAAGTG AGTATTAGAA GATTTGGTCT TTATCCTCAG GTGGAAAGGA GTTTTTTGGC AATATCTTCA GGCTTTAAAA GGATTGCAAA GCCACATTCA CTTAAAAGGG TATGGGATAA AGTATCTTCT TTGAAGAAGA AGAAGAAGAA TACTTATGCG AAAAATTCAA AGGAAGCAGA GGTAGTATCT AATGATACTT TGGATGTTAA AAAACTGAGA AGATTTTAA
|
Protein sequence | MRIEFERPFI LLAAAVLGVF IWLVSRRFSK ESLGKRFVVW VRIVLITLII LALSVPSLAI STDKITTIYL ADMSESNRKN AEKMKDFIQR SIKLKKSNEL QSVVVFGQDA NIEFTPTKYP NFSEFGTAVD STQTNIENAI KYAVNLFDKD QQKRLVILTD GKETIGEAKN EVELLKNNGI DVKVVLFKSE KEKDVQFSSL KIPQKVFKNQ AFSIFANINT TFSTKALLKI FRDNDLIFSQ QVTLEKGENR FVIKDKLDKE GVFSYQGEVE AIDDEEESNN VAYAMCQVRK PQKVLVVYED IDDVKNVKNL LDSYSADYDA VKSDQANFGL DKLLGYSFVI LCNVSRESFS DEFLSSVEKY VKDLGGGLLV IGGTNSYALG NYSNSVLEKM LPVKMEIKNK EKEKNIDVVL VLDHSGSMAD TEDAGIPKLE IAKSASAKMI EHLESSDGVG VIAFDHNYYW AYKFGKISKK EDVIESISSI EVGGGTAIIP PLSEAVKTLK KSKAKSKLIV LLTDGMGEQG GYEIPANEAK RNNIKITTIG VGKYVNATVL SWIASFTSGR FYLVSNPSEL VDVFLKETKI IKGKYIKEKK FVPKVVETNS INSGFSSYPP LYGYIATTKK DLATSLLVSD EDEPILTVWR YGLGKVAAWC SDLSGQWSRD WVLWDRFSQF WTKLFKWLEK GADDSSYDFN VRKEKDLVVS LVGKFDADTI ATLKCMYPNG KEKTIAMRRT APDRFESKVD FMLGNYVFVA TLSNKDKSKV STFFYSANYS DEFRVDIDSS RFEQFISLSG AKVIKNPNEV YAGKLKSTEE KRDISSLLIV LCIVLFLLEV SIRRFGLYPQ VERSFLAISS GFKRIAKPHS LKRVWDKVSS LKKKKKNTYA KNSKEAEVVS NDTLDVKKLR RF
|
| |