Gene Athe_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0045 
Symbol 
ID7407280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp58232 
End bp60940 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content37% 
IMG OID643714455 
Productvon Willebrand factor type A 
Protein accessionYP_002571980 
Protein GI222528098 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAG AGTTTGAAAG ACCTTTTATT TTACTTGCTG CAGCTGTGCT TGGAGTATTT 
ATTTGGCTTG TTTCAAGAAG ATTTTCAAAA GAAAGTTTGG GTAAAAGATT TGTTGTATGG
GTGAGGATTG TTTTAATAAC GCTCATAATC CTGGCTCTAA GCGTGCCAAG CTTGGCAATT
TCAACCGACA AAATAACAAC AATTTATCTT GCGGATATGT CAGAGAGCAA TAGAAAGAAC
GCAGAAAAAA TGAAAGACTT TATTCAAAGG TCGATAAAGC TCAAAAAATC AAATGAATTG
CAGTCAGTTG TTGTGTTCGG GCAGGATGCA AACATTGAGT TTACACCGAC TAAGTATCCT
AACTTTTCTG AATTTGGAAC GGCTGTAGAT AGCACTCAAA CCAATATTGA AAATGCAATA
AAGTATGCAG TAAATCTATT TGACAAGGAT CAACAGAAAA GACTTGTCAT TCTAACAGAT
GGGAAAGAGA CAATAGGTGA GGCAAAAAAT GAAGTAGAAC TTCTCAAAAA TAATGGAATA
GATGTAAAAG TAGTGCTATT TAAAAGTGAG AAAGAAAAAG ATGTTCAATT TTCAAGTTTG
AAGATTCCGC AGAAGGTATT TAAAAATCAG GCGTTTTCGA TATTTGCCAA TATAAACACC
ACTTTTTCAA CAAAAGCTCT GCTCAAGATT TTCAGAGACA ATGATTTAAT TTTCAGCCAG
CAGGTGACGC TTGAAAAAGG TGAAAACAGG TTTGTGATAA AGGATAAACT GGACAAGGAA
GGGGTTTTCA GCTATCAAGG AGAAGTTGAA GCGATAGATG ATGAAGAAGA GTCAAACAAT
GTTGCGTATG CTATGTGTCA GGTAAGAAAA CCTCAAAAGG TTCTGGTAGT TTATGAGGAT
ATTGATGATG TGAAAAATGT CAAAAATCTT CTTGATTCTT ATTCAGCCGA TTATGATGCA
GTAAAGAGCG ACCAGGCAAA CTTTGGGCTT GACAAACTCT TAGGGTATTC TTTTGTGATT
TTGTGCAATG TGTCAAGAGA AAGCTTTAGC GATGAGTTTT TAAGCTCTGT AGAGAAGTAT
GTGAAGGATT TAGGTGGTGG ACTTTTGGTA ATTGGCGGCA CAAATTCATA TGCGCTTGGG
AATTATTCTA ATTCGGTTTT AGAAAAGATG CTGCCTGTCA AGATGGAGAT TAAGAACAAA
GAAAAGGAAA AGAACATAGA TGTTGTACTT GTCCTTGACC ATTCAGGCAG CATGGCAGAT
ACAGAAGACG CAGGTATTCC AAAATTAGAG ATTGCCAAGA GTGCTTCTGC AAAGATGATT
GAGCACCTTG AAAGTTCAGA TGGTGTTGGT GTGATTGCTT TTGACCACAA TTATTATTGG
GCATACAAAT TTGGTAAGAT AAGCAAAAAA GAAGATGTGA TAGAAAGCAT ATCAAGCATC
GAAGTAGGTG GTGGGACGGC TATAATTCCA CCCTTGAGTG AAGCAGTTAA AACTCTAAAA
AAGTCAAAGG CAAAAAGCAA GTTGATTGTG CTTCTGACTG ATGGCATGGG TGAACAAGGC
GGTTATGAAA TTCCAGCCAA TGAGGCAAAA AGAAATAACA TCAAAATCAC CACAATTGGT
GTTGGAAAGT ATGTAAACGC TACAGTTTTG AGTTGGATAG CCTCTTTTAC CTCAGGCAGG
TTCTATTTAG TTTCCAATCC TTCTGAGCTT GTTGATGTGT TTTTAAAAGA GACAAAAATT
ATAAAAGGCA AGTACATAAA GGAAAAGAAG TTTGTCCCCA AAGTGGTTGA GACAAATTCA
ATAAACTCAG GTTTTTCCTC TTATCCACCG CTTTACGGTT ATATCGCAAC AACAAAAAAA
GACCTTGCAA CCAGTCTTTT GGTAAGCGAC GAGGACGAGC CTATTTTGAC AGTGTGGAGG
TACGGGCTTG GAAAGGTTGC TGCATGGTGT TCGGACTTGA GCGGCCAGTG GTCGCGTGAC
TGGGTTTTGT GGGATAGATT TTCACAGTTT TGGACAAAGC TTTTCAAGTG GTTGGAAAAA
GGGGCAGATG ACAGCAGCTA TGATTTTAAT GTTCGAAAAG AAAAAGATTT GGTGGTATCT
TTGGTAGGCA AGTTCGATGC TGATACCATT GCAACTCTTA AATGCATGTA TCCAAACGGC
AAAGAAAAGA CAATTGCAAT GAGAAGAACT GCACCAGATA GGTTCGAATC TAAAGTTGAT
TTTATGCTGG GAAATTATGT ATTTGTGGCA ACCTTAAGTA ATAAGGACAA ATCCAAGGTA
TCAACGTTTT TCTATTCAGC TAACTACTCT GATGAGTTTA GAGTGGATAT AGACAGCAGC
AGGTTTGAAC AGTTTATCTC CTTATCAGGT GCAAAAGTTA TAAAAAATCC CAATGAGGTG
TATGCGGGAA AGCTCAAGAG CACAGAAGAA AAGAGGGATA TTAGCAGTTT ATTAATAGTT
CTTTGTATTG TATTATTCCT TTTGGAAGTG AGTATTAGAA GATTTGGTCT TTATCCTCAG
GTGGAAAGGA GTTTTTTGGC AATATCTTCA GGCTTTAAAA GGATTGCAAA GCCACATTCA
CTTAAAAGGG TATGGGATAA AGTATCTTCT TTGAAGAAGA AGAAGAAGAA TACTTATGCG
AAAAATTCAA AGGAAGCAGA GGTAGTATCT AATGATACTT TGGATGTTAA AAAACTGAGA
AGATTTTAA
 
Protein sequence
MRIEFERPFI LLAAAVLGVF IWLVSRRFSK ESLGKRFVVW VRIVLITLII LALSVPSLAI 
STDKITTIYL ADMSESNRKN AEKMKDFIQR SIKLKKSNEL QSVVVFGQDA NIEFTPTKYP
NFSEFGTAVD STQTNIENAI KYAVNLFDKD QQKRLVILTD GKETIGEAKN EVELLKNNGI
DVKVVLFKSE KEKDVQFSSL KIPQKVFKNQ AFSIFANINT TFSTKALLKI FRDNDLIFSQ
QVTLEKGENR FVIKDKLDKE GVFSYQGEVE AIDDEEESNN VAYAMCQVRK PQKVLVVYED
IDDVKNVKNL LDSYSADYDA VKSDQANFGL DKLLGYSFVI LCNVSRESFS DEFLSSVEKY
VKDLGGGLLV IGGTNSYALG NYSNSVLEKM LPVKMEIKNK EKEKNIDVVL VLDHSGSMAD
TEDAGIPKLE IAKSASAKMI EHLESSDGVG VIAFDHNYYW AYKFGKISKK EDVIESISSI
EVGGGTAIIP PLSEAVKTLK KSKAKSKLIV LLTDGMGEQG GYEIPANEAK RNNIKITTIG
VGKYVNATVL SWIASFTSGR FYLVSNPSEL VDVFLKETKI IKGKYIKEKK FVPKVVETNS
INSGFSSYPP LYGYIATTKK DLATSLLVSD EDEPILTVWR YGLGKVAAWC SDLSGQWSRD
WVLWDRFSQF WTKLFKWLEK GADDSSYDFN VRKEKDLVVS LVGKFDADTI ATLKCMYPNG
KEKTIAMRRT APDRFESKVD FMLGNYVFVA TLSNKDKSKV STFFYSANYS DEFRVDIDSS
RFEQFISLSG AKVIKNPNEV YAGKLKSTEE KRDISSLLIV LCIVLFLLEV SIRRFGLYPQ
VERSFLAISS GFKRIAKPHS LKRVWDKVSS LKKKKKNTYA KNSKEAEVVS NDTLDVKKLR
RF