Gene Dred_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_2244 
Symbol 
ID4955406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp2453286 
End bp2454536 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID640181417 
Productvon Willebrand factor, type A 
Protein accessionYP_001113581 
Protein GI134300085 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0785649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAA TTCAAATTGA TCTTGCTCTA GATAAGACCT ATCTGCTACC AGGTAATAAG 
CAGGTGGCCT ATCTGATGGT AAAGCTTACT GCGCCTAAGC AAGTGGAGAA GGAAAGGCCG
GTGCAGAATC TGTCCTTTGT TATTGACCGC AGCGGCAGTA TGGCAGGGGA AAAGCTAGAC
TACACCAAAA AGGCAGTTGC CTTTGCGGTT GGTCATCTAA GTCCACAGGA TTACTGCTCG
GTAGTAGCCT TTGACGATAT GGTAACGATG GTGGCCTCCT CTCACCAGGT GGCAAACAAA
GATGCACTTA AGATGGCGGT GGAAAGTATC TATCCCGGTG GCAGCACAAA CCTAAGCGGC
GGCATGCTGC TGGGCGTAAG GGAAGTAAAG CTGGCCCACA AAGAGAATCA AATCAACCGG
GTGCTGCTGC TAACAGATGG CATGGCCAAT GTGGGAGTGA CAGACCACAG TGCCCTGGTG
GAGAAGTCCC GGGAAATGGC TGCCGGTGGG GTTAATCTTT CTACCTTTGG TTTGGGGGAA
GATTTTGAAG AAGATTTATT GCAGGCAATG GTGGAGGCTG GGGGCGGTAA CTTCTATTAT
ATAGAAAAAC CGGATCAAAT ACCTGGTATT TTTGAACAGG AATTAACTGG GTTGCTAAGT
ATCGTGGCCC AAAATCTCTC AGTAAAAGTG AAACCGGGGC AAGGTGTGTC TATAACCGGA
GTGCTTGGTT ATCCCTTTAG CTCCGAGGAA GGGGTTACTG TAAACCTGCC GGATATTTAT
AGTGGTGAAT CAAAGCTATT GCTACTGGAG TTGCTTATTT CGCCGCTGAC GGAAGGTAAT
CACAAGCTCA TCAGTGTAGA GTTGGATTAT GCAGACGTTC GAAAAAGCCT GGCGCTGGTG
AATCTCAAGG CAGAGCTAAG TATAAATGCT AGTGCGGAAA TAGGGGATGA ACCTGCTGAA
AACATAGAGG TGATCAAGCA GGTGGAACTA TTCCGCTGTG CTCAGGCTAA GGAAGAAGCT
ATTCGGTTAG CTGATCAGGG AGACTTCCAG GCTAGTCGTC TTGTCTTGGA AAATCAGTTA
TATAAGCTAC AGTCTTTGGG AGCTTGTTTA GATTCTAGTG ATCTTAATAT GGAAGTAAAC
GAATTACAGG AAAACCTTTG CTTTATGTCC GAGGGCAGTT ATGATAAGGC CTCACGGAAG
AAAATGTCCT TTAACGCTTA CCAACGGAAG AAAGGGAGAG GTAGAAAATA A
 
Protein sequence
MEQIQIDLAL DKTYLLPGNK QVAYLMVKLT APKQVEKERP VQNLSFVIDR SGSMAGEKLD 
YTKKAVAFAV GHLSPQDYCS VVAFDDMVTM VASSHQVANK DALKMAVESI YPGGSTNLSG
GMLLGVREVK LAHKENQINR VLLLTDGMAN VGVTDHSALV EKSREMAAGG VNLSTFGLGE
DFEEDLLQAM VEAGGGNFYY IEKPDQIPGI FEQELTGLLS IVAQNLSVKV KPGQGVSITG
VLGYPFSSEE GVTVNLPDIY SGESKLLLLE LLISPLTEGN HKLISVELDY ADVRKSLALV
NLKAELSINA SAEIGDEPAE NIEVIKQVEL FRCAQAKEEA IRLADQGDFQ ASRLVLENQL
YKLQSLGACL DSSDLNMEVN ELQENLCFMS EGSYDKASRK KMSFNAYQRK KGRGRK