Gene SO_3552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3552 
Symbol 
ID1171223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp3708345 
End bp3710210 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content44% 
IMG OID637345351 
Productvon Willebrand factor type A domain-containing protein 
Protein accessionNP_719099 
Protein GI24375056 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACATC CATTACGTTT ATTGAACACC CAACTCGTTT CCATCTCCTC ACATAAAAAA 
CGTCACAATG CAGCTGTTTC TGCCTTAACA CTCGCCATTC TATTGGGGTT GAGTGGTTGT
ATTGATAAAC AAACTGAATC GGAGAGTCGC ACTGAATTAG CGACTCAAGC TAACGAGGCT
GCAGAGCAGC AAGCTGCGCT TGTGCAACGT GTTGAGGCTG AACGGCAGGC TAAAATACAA
CATGAAAATG AATTAAGTGC AAAATCGCAG GATATGCGTG CTGAACATAT GCCTTATATT
GCTCAATATG CCGCTAGTTC TAGTGTAGCC GCCCCCGGGT TAAACGATGA TTGGCAAGGG
GCAGTGCTTC CTGAGCGTAA TCAATTTGAA AAGCAAGTAC AAAACGGCAT TATGGTCGCG
GGAGAAATCC CTGTATCGAC CTTTTCAATC GATGTTGATA CCGGCAGTTA TACCACATTA
AGGCGAATGC TAAAGGAAGG GCGGTTACCG CAGAAAGACA CGTTACGCGT TGAAGAAATG
CTGAATTATT TTTCTTATAA CTACCCACAA CCCAATAAAA ATGAGGCGCC ATTTAGTGTA
ACAACAGAGC TTGCACCATC ACCTTATAAT GATGACATGA TGTTATTGCG TATTGGCTTA
AAGGGATATG AGCAGAGTAA AGCTGAGCTT GGGGCGAGTA ATTTAGTCTT CTTGCTGGAT
GTCTCTGGCT CTATGGCGTC TGATGATAAG TTACCTCTGC TGCAAACCGC CTTAAAAATG
CTGACTCAAC AACTGGATGA ACAGGATAAA GTATCGATTG TGGTCTACGC GGGGGCTGCA
GGCGTGGTGC TCGATGGCGC TGCAGGTAAC GATATTAAAA TCCTTACCTA TGCATTGGAA
CAGTTAACTG CAGGAGGTTC AACCAATGGC GCAGAAGGGA TCCAACTGGC TTATCAGTTA
GCGCAGAAAC ATTTTGTTAA AGGTGGCATT AACCGAGTCA TTCTCGCAAC CGACGGTGAT
TTTAATGTTG GGACCACCAA CCTCGATGAG TTAGTCGACT TGGTTGAAGT ACAGAAAAAA
CATGGAATTG GCTTGACGAC ACTTGGCTTT GGCATGGGTA ACTACAATGA CCACTTAATG
GAGCAACTTG CCAATAAAGG TAATGGACAA TATGCCTACA TTGATTCTGT GAACGAGGCT
CGAAAAGTGC TGGTAGAACA GTTAGGTGCG ACTTTGCTGA CCATCGTCAA AGAGGTAAAA
GTGCAAGTTG AGTTTAATCC CGCTTTAGTT TCAGAGTACC GTCTTATTGG TTATGAAAAC
CGCGCCTTAG CCCGCGAGGA TTTTAATAAC GATAAGGTGG ACGCTGGGGA AATTGGTGCG
GGTCATACCG TTACCGCGCT TTATGAACTT CGCTATGTTG AAACCGGTCA TATGGCGAAT
GACAAACTCC GTTACGGTTA TAACCCTGAT ACAGGCAGTG AAAAATATAG CCGTGACGAA
ATTGCCTATC TTAAATTACG TTATCAACTG CCCGATGCGA GTAAAAGCCA ATTATTGACG
TATCCGATAA GAGCCGATCA GAGCGTTAAA ACGGTAAATC AAGCCAGTGA TGATTTTAGA
TTTGCGGCGG CAGTTGCGGG ATTAGGGCAG TTACTGAATC AAAGCCACTA CTTGCATCAA
TTTGATTATA ATAAACTTAG GGGGCTTACT CGTTCTGCAC TGGGGGAAGA TACAATGGGT
TATCGACATG AGTTTATGCA ACTTGTTGAT ACTGCAGCGC TGCTAGCGCA AACGAATCAA
GTCCCCATTA AGAAATCCTT TGATGCAGAG AATAAACCTT TTCCACCTCA GGATAAACTT
CATTAG
 
Protein sequence
MTHPLRLLNT QLVSISSHKK RHNAAVSALT LAILLGLSGC IDKQTESESR TELATQANEA 
AEQQAALVQR VEAERQAKIQ HENELSAKSQ DMRAEHMPYI AQYAASSSVA APGLNDDWQG
AVLPERNQFE KQVQNGIMVA GEIPVSTFSI DVDTGSYTTL RRMLKEGRLP QKDTLRVEEM
LNYFSYNYPQ PNKNEAPFSV TTELAPSPYN DDMMLLRIGL KGYEQSKAEL GASNLVFLLD
VSGSMASDDK LPLLQTALKM LTQQLDEQDK VSIVVYAGAA GVVLDGAAGN DIKILTYALE
QLTAGGSTNG AEGIQLAYQL AQKHFVKGGI NRVILATDGD FNVGTTNLDE LVDLVEVQKK
HGIGLTTLGF GMGNYNDHLM EQLANKGNGQ YAYIDSVNEA RKVLVEQLGA TLLTIVKEVK
VQVEFNPALV SEYRLIGYEN RALAREDFNN DKVDAGEIGA GHTVTALYEL RYVETGHMAN
DKLRYGYNPD TGSEKYSRDE IAYLKLRYQL PDASKSQLLT YPIRADQSVK TVNQASDDFR
FAAAVAGLGQ LLNQSHYLHQ FDYNKLRGLT RSALGEDTMG YRHEFMQLVD TAALLAQTNQ
VPIKKSFDAE NKPFPPQDKL H