Gene SeD_A2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2659 
Symbol 
ID6871247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2542837 
End bp2544618 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content53% 
IMG OID642785726 
Productvon Willebrand factor, type A 
Protein accessionYP_002216383 
Protein GI198245970 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACG GAAAAACATT AATGCTGTTG TTGGGTGGGG TTATTCTCTC GGGCTGTGGG 
CCGGAGCCCT CCGACCCGCA GGGAAATAAT CCGGCTGAGT TAAAACAGAA GCGGACTATC
CAGAAAGAAA ATAGCGCTCA GGCCGGGGAT GACACTGTCC AGAAAAGACA AGCCGAGGCC
GCCCAACAGG CTGCGAAAAA AGCAGCCGAA TATGCTGAAG CCAAAGCCCT GGCAGATGCC
AAAGCGGCAT CTCTGGCAAC AGCCGAAGCA CCGCAATATG TAATGCAGAC ACGTGCCGCG
GCACCAAAAG CGTTCGCTGC ACAAGGCGGT AATGTAATGG GGACCGCGCG TTACGAACAC
TACGATGAAA ATCCGATTAA ACAGGTAAGT CGGGCGCCGC TCGCCACGTT TAGTCTGGAT
GTAGACACTG GCAGCTACGC TAACGTGCGG CGCTTTCTGA ATCAGGGACA ACTGCCGCCG
CCGGAAGCCG TGCGGGTAGA AGAGATGCTC AATTATTTTC CCGCGCCTCA GCCCGTTGCG
GATAAGCAGG ATAACACTAA ACCCATTGCG GCCTGTATAC CGATGCCATT TGCGGTTAAA
TACGAACTGG CTCCCTCGCC GTGGAACGCG CAGCGTACGC TATTAAAAGT TGATGTTCAG
GCCCGGGACA TGCAGACCAG AGATCTGCCA CCTGCCAACC TGGTTTTTCT CATTGATACT
TCTGGTTCTA TGCAGCCAGC GGAACGCCTG CCGTTGATCC GGTCGGCGCT AAAACTGTTG
GTGAACGATC TGCGTGCGCA GGATAACATC ACTATTGTGA CCTACGCGGG CGGCACTCAC
GTCGCGCTGG CGTCTACAGC GGGAAATAAC ACAACCGCGA TTAAAGCGGC AATTGATAAT
CTGGATACTT ACGGGAGTAC CGGTGGTGAA GCGGGATTAC GGCTAGCCTA CGAGCAGGCC
GAAAAAGGGT TTATCAAAGG CGGCGTTAAC CGCATTTTGT TGACCACCGA CGGTGATTTT
AACCTCGGTA TTACCGATCC CAAAGACATC GAAGCGCTGG TAAAAAAAGA GCGTGAGAAA
GGTATTACCT TATCTACGCT GGGCGTCGGC GATGACAATT TCAACGAAGC CATGATGGTG
AGGATTGCTG ATGTGGGTAA CGGCAATTAC AGCTACATCG ACTCCCTCTC CGAGGCGCAA
AAAGTCCTCA AGGATGAGAT GCATCAAACG CTGGTCACCG TTGCCAAAGA TGTAAAATCG
CAAATCGAAT TTAATCCGCA GTGGGTGACT GAGTACCGGC AGATTGGTTA TGAAAAACGC
CAACTGCGCG ACGAGGATTT CAATAACGAT AAGGTTGATG CCGGTGATAT CGGCGCGGGT
AAACACGTCA CGCTATTCTT TGAACTGACG CTTAACGGGC AGAAAGCCTC GGTGGATAAA
CTGCGCTACG CTCAGGACAA AGCCGCCTCA AAGCCAACAA AATCAAGCGA GCTGGCGTGG
ATCAAATTGC GCTGGAAAGC GCCGCAGGGC AGCGAAAGTA CATTAGCCGA GCTCCCGGTC
GTTATGGGAA AGATGCCGAT CTTTGCTGAC GCCTCTGAAG ATTTTCGTTT CCGCGCGGCG
GTAGCGGCTT TCGGGCAAAA ACTGCGTGGC TCAGAAACGC TGGCAGATAC GACCTGGCCG
CAAATTATTA AATGGGGTGA ACAGGCGCGC GGGGAAGATA AACAAGGCTA TCGCGCGGAG
TTTATTAAAC TGGTAAAACT GGCGGAAGGC TTGTCTCACT AA
 
Protein sequence
MLNGKTLMLL LGGVILSGCG PEPSDPQGNN PAELKQKRTI QKENSAQAGD DTVQKRQAEA 
AQQAAKKAAE YAEAKALADA KAASLATAEA PQYVMQTRAA APKAFAAQGG NVMGTARYEH
YDENPIKQVS RAPLATFSLD VDTGSYANVR RFLNQGQLPP PEAVRVEEML NYFPAPQPVA
DKQDNTKPIA ACIPMPFAVK YELAPSPWNA QRTLLKVDVQ ARDMQTRDLP PANLVFLIDT
SGSMQPAERL PLIRSALKLL VNDLRAQDNI TIVTYAGGTH VALASTAGNN TTAIKAAIDN
LDTYGSTGGE AGLRLAYEQA EKGFIKGGVN RILLTTDGDF NLGITDPKDI EALVKKEREK
GITLSTLGVG DDNFNEAMMV RIADVGNGNY SYIDSLSEAQ KVLKDEMHQT LVTVAKDVKS
QIEFNPQWVT EYRQIGYEKR QLRDEDFNND KVDAGDIGAG KHVTLFFELT LNGQKASVDK
LRYAQDKAAS KPTKSSELAW IKLRWKAPQG SESTLAELPV VMGKMPIFAD ASEDFRFRAA
VAAFGQKLRG SETLADTTWP QIIKWGEQAR GEDKQGYRAE FIKLVKLAEG LSH