Gene Acel_1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1893 
Symbol 
ID4486150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2140552 
End bp2142540 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content67% 
IMG OID639730683 
Productvon Willebrand factor, type A 
Protein accessionYP_873651 
Protein GI117929100 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.376882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAT CAGCCTACCG GTACGGACCG TTTCATGACG GGCCGGATCC GTTGGCGCCG 
CCGTACGACG TCGCACGGGC GCTGGACGAG CTCGGGGACG ACGTTCTTTC TGGCGCGAGT
CCCGCAGACG CTCTGAGAAA GCTGCTGCGC CACGGTGCAC CGGGACTGCG TGGCACCGAC
GACCTCCTGC GCCAGGTCCG GGAACGCCGG CGTGCCCTTC GGGAGAGCGG CCGGTTCGGC
GGGACACTCG AGCAGGCCCG TGCTTTGCTG GACAAGGCCA TCGGCCAGGA ACGCGCCGCC
CTTTTCCCCG ATCCCAGCGA TGATGCGCGG TTGCGGGAGG CCGAACTCGA CGCCCTCCCT
GCGGACACCG CACGAGCCAT CCGAGCGCTC GCCGACTACG ATTGGCGAAG CCCGCAGGCT
CGGCGAACCT ATGAGGAATT GAAGAATCTG CTGCGTGATG AGGTGCTCGA CACCCAGTTT
CGGGGGATGC GGGAGGCGCT CCGCCAGATG CGGGACGCCT CGTCCAGCGC GACCGCTGCC
GCCGTCAAGG ACATGCTCGC CGACCTCAAT GACATGCTCG CGGCCGACGA GCGCGGCGAG
CACACGCAGG AGAAATTCGA CGACTTCATG GCCCGCCACG GCCACTTTTT TCCCGATAAT
CCCAGGAATC TCGACGAATT GGTCGACTCG CTGGCGCGGC GGGCGGCCGC CATGGAACGC
ATGCTCGCGT CCATGAGCCG GGAACAGCGG GAAGAACTCG CGGCGTTGAT GGCTCAGGTC
ATGGCCGACC TCGGCCTGGC GGCTGAACTG GCCCGTCTCA ATGACGCGCT GCGCCGCCGA
CGCCCGGATC TCGACTGGTC CGGCCGGACC CGGCTCCGCG GCGACGAACC ACTGTCGGCC
CCGGATGCGA CGTCGGTTCT CGAGGAGCTC GCGGATCTCG AAGAAGTCGC CGCCACGCTT
GCGCAGGATT ATCCCGGCGC CCGCCTCGAC GACATTGATG AGGAAGCGGT CCGGCGCGCA
CTCGGCCGCA GTGCAGTAGA CGATCTGCGC CGGTTGCGGG ACATCGAACG CGAATTGGAA
CGGCAGGGGT ACATCCGCCG CGAGGCCGGC CGGCTGGAGT TGACGCCGAA AGCGGTCCGC
CGCCTCGGCG CGACCGCACT CCGGCGGATT TTCGCCTCGC TGGAAGGAGC GCGATCCGGC
GGCCACGATA CCCCCGATGC CGGGACCGCC GGTGAATTGA CGGGCTCGTC GCGACCATGG
GAATTCGGCG ACGAGCAGCC CCTCGACGTC GTCCGCAGCC TGCGCAACGC GATCCGGAAC
GGCCGTGTCC GGCGGGAACC CGACGGCCGC CCGGCACTGC GCCTCGCCGT CGAGGATTTC
GAGGTCTTCG AAACCGAACG GCGGACCGCC GCCGCCGTCT GCCTGCTCGT CGACCTCTCC
TGGTCGATGA CCCTGCGCGG CACGTGGGGC GCCGCCAAGG CAACCGCACT GGCGTTGCAC
TCCCTGGTCA CGACGCAATT CCCGCAGGAC GCCCTGCAAA TCATCGGTTT TTCGAATTAC
GGCCGAGTAC TTCAGCCCAC CGAGCTCGCC GGCCTGGACG CCGAAATGGT GCAGGGCACC
AATTTGCAGC ACGCCCTCCT CATCGCCGGC CGCTTTCTCG ACCGCCATCC CGAATACGAA
CCCATCGTCA TGATTGTCAC GGACGGCGAA CCGACCGCTC ACCTCCTGCC GGATGGCGAC
TACGCCTTCG ACTGGCCACC GTCCCGGCAG ACGATCACAC TCACACTGGC CGAAGTCGAC
AAGATGACCC GGCGCGGCGC CGCCTTGAAT GTCTTCATGC TGGCGGACGA TCCGGGGTTG
GTGGACTTCG TCGAACTCAT GGCAAAACGC AACGGTGGCA GGGTCTTTTC ACCGTCCAAG
GAGAGACTCG GCAGCTACGT GGTCAGCGAC TATTTACGAT CGAGGCGTGG ACGACGTCGG
GCGGGCTGA
 
Protein sequence
MSASAYRYGP FHDGPDPLAP PYDVARALDE LGDDVLSGAS PADALRKLLR HGAPGLRGTD 
DLLRQVRERR RALRESGRFG GTLEQARALL DKAIGQERAA LFPDPSDDAR LREAELDALP
ADTARAIRAL ADYDWRSPQA RRTYEELKNL LRDEVLDTQF RGMREALRQM RDASSSATAA
AVKDMLADLN DMLAADERGE HTQEKFDDFM ARHGHFFPDN PRNLDELVDS LARRAAAMER
MLASMSREQR EELAALMAQV MADLGLAAEL ARLNDALRRR RPDLDWSGRT RLRGDEPLSA
PDATSVLEEL ADLEEVAATL AQDYPGARLD DIDEEAVRRA LGRSAVDDLR RLRDIERELE
RQGYIRREAG RLELTPKAVR RLGATALRRI FASLEGARSG GHDTPDAGTA GELTGSSRPW
EFGDEQPLDV VRSLRNAIRN GRVRREPDGR PALRLAVEDF EVFETERRTA AAVCLLVDLS
WSMTLRGTWG AAKATALALH SLVTTQFPQD ALQIIGFSNY GRVLQPTELA GLDAEMVQGT
NLQHALLIAG RFLDRHPEYE PIVMIVTDGE PTAHLLPDGD YAFDWPPSRQ TITLTLAEVD
KMTRRGAALN VFMLADDPGL VDFVELMAKR NGGRVFSPSK ERLGSYVVSD YLRSRRGRRR
AG