Gene SeAg_B2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2451 
Symbol 
ID6796754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2373180 
End bp2374976 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content53% 
IMG OID642776647 
Productvon Willebrand factor type A domain protein 
Protein accessionYP_002147271 
Protein GI197250621 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATG GAAAAACATT AATGCTGTTG TTGGGTGGGG TTGTTCTCTC GGGCTGTGGG 
CCGGAGCCCT CCGACCCGCA GGGAAATAAT CCGGCTGAGT TAAAACAGGA GCAGGCTATC
CAGAAAGAAA ATAGCGCTCA GGCCGGGGAT GACACTGTCC AGAAAAGACA AGCCGAGGCC
GCCCAACAGG CTGCGAAAAA AGCAGCCGAA TATAAAGCGA ATGCTGAAGC CAAAGCGGCA
TCTCTGGCAG ATCCCAAAGC GGGATCTCTG GCAACAGCCG AAGCACCGCA ACATGAAATG
CGGACACGTG CCGTGGCATC AAAAGCGTTC GCTGCACAAG GCGGTAATGT AATGGGGACC
GCGCGTTACG AACACTACGA TGAAAATCCG ATTAAACAGG TAAGTCAGGC GCCGCTCGCC
ACGTTTAGTC TGGATGTAGA CACTGGCAGC TACGCTAACG TGCGGCGCTT TCTGAATCAG
GGACAACTGC CGCCGCCGGA AGCCGTGCGG GTAGAAGAGA TGCTCAACTA TTTTCCCGCG
CCTCAGCCCG TTGCGGATAA GCAGGATAAC ACTAAACCCA TTGCGGCCTG TATACCGATG
CCATTTGCGG TTAAATACGA ACTGGCTCCC TCGCCGTGGA ACGCGCAGCG TACGCTATTA
AAAGTTGATG TTCAGGCCCG GGACATGCAG ACCAGAGATC TACCACCTGC CAACCTGGTT
TTTCTCATTG ATACTTCCGG CTCTATGCAG CCAGCGGAAC GCCTGCCGTT GATCCGGTCG
GCGCTAAAAC TGTTGGTGAA CGATCTGCGT GCGCAGGATA ACATCACGAT TGTGACCTAC
GCGGGCGGCA CTCACGTCGC GCTGGCGTCT ACAGCGGGAA ATAACACAAC CGCGATTAAA
GCGGCAATTG ATAATCTGGA TGCTTACGGG AGTACCGGTG GTGAAGCGGG ATTACGACTA
GCCTACGAGC AGGCCGAAAA AGGGTTTATC AAAGGCGGCG TTAACCGCAT TTTGTTGACC
ACCGACGGTG ATTTTAACCT CGGTATTACC GATCCCAAAG ACATCGAAGC GCTGGTAAAA
AAAGAGCGCG AGAAAGGTAT TACCTTATCT ACGCTGGGCG TCGGCGATGA CAATTTCAAC
GAAGCCATGA TGGTGAGAAT TGCTGATGTG GGTAACGGCA ATTACAGCTA CATCGACTCC
CTCTCCGAGG CGCAAAAAGT CCTCAAAGAT GAGATGCATC AAACGCTGGT CACCGTTGCC
AAAGATGTAA AATCGCAAAT CGAATTTAAT CCGCAGTGGG TGACTGAGTA CCGGCAGATT
GGTTATGAAA AACGCCAACT GCGCGACGAG GATTTCAATA ACGATAAGAT TGATGCCGGT
GATATCGGCG CGGGTAAACA CGTCACGCTA TTCTTTGAAC TGACGCTTAA CGGGCAGAAA
GCTTCGGTGG ATAAACTGCG CTACGCTCAG GACAAAGCTG CCTCAAAGAC AACAAAATCA
AGCGAGCTGG CGTGGATCAA ATTGCGCTGG AAAGCGCCGC AGGGCAGCGA AAGTACATTA
GCCGAGTTCC CGGTCGTTAT GGGAAAGATG CCGATCTTTG CTGACGCCTC TGAAGATTTT
CGTTTCCGCG CGGCGGTAGC GGCTTTCGGG CAAAAACTGC GTGGCTCAGA AACGCTGGCA
GATACGACCT GGCCGCAAAT TATTAAATGG GGTGAACAGG CGCGCGGGGA AGATAGACAA
GGCTATCGCG CGGAGTTTAT TAAACTGGTA AAACTGGCGG AAGGCTTGTC TCACTAA
 
Protein sequence
MLNGKTLMLL LGGVVLSGCG PEPSDPQGNN PAELKQEQAI QKENSAQAGD DTVQKRQAEA 
AQQAAKKAAE YKANAEAKAA SLADPKAGSL ATAEAPQHEM RTRAVASKAF AAQGGNVMGT
ARYEHYDENP IKQVSQAPLA TFSLDVDTGS YANVRRFLNQ GQLPPPEAVR VEEMLNYFPA
PQPVADKQDN TKPIAACIPM PFAVKYELAP SPWNAQRTLL KVDVQARDMQ TRDLPPANLV
FLIDTSGSMQ PAERLPLIRS ALKLLVNDLR AQDNITIVTY AGGTHVALAS TAGNNTTAIK
AAIDNLDAYG STGGEAGLRL AYEQAEKGFI KGGVNRILLT TDGDFNLGIT DPKDIEALVK
KEREKGITLS TLGVGDDNFN EAMMVRIADV GNGNYSYIDS LSEAQKVLKD EMHQTLVTVA
KDVKSQIEFN PQWVTEYRQI GYEKRQLRDE DFNNDKIDAG DIGAGKHVTL FFELTLNGQK
ASVDKLRYAQ DKAASKTTKS SELAWIKLRW KAPQGSESTL AEFPVVMGKM PIFADASEDF
RFRAAVAAFG QKLRGSETLA DTTWPQIIKW GEQARGEDRQ GYRAEFIKLV KLAEGLSH