Gene Shewana3_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3153 
Symbol 
ID4477747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp3780541 
End bp3782382 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content50% 
IMG OID639727757 
Productvon Willebrand factor, type A 
Protein accessionYP_870783 
Protein GI117921591 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.573485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.703806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATATC CATCTTCAGT CTTTCGCCAT AAAGCGCATT TAAGTTTTGT TGTTTCGGGA 
TTAACGCTCG CTATTTTGCT CGGTTTGAGT GCTTGCAGTG ATAAAGCCGC CGAGCAGCAA
ACCCCTGCTG AATTAGCCGC TCAAGCAAAA CTCGCCGCCG AGCAACAGGC CGAGCGTCAG
GCCAATAGGC AAAGAGATGC CGCAATCGCC ATGCATGAAC AAGCCTCGTC AGCAAAACTG
CGGACAATGA GTGCTGAGAG TCGAGCCTAT ATTGCGCAAC CTACTGCCAG TATCAGTGCT
GCGCCCGCGT TAAACGGCGA TTGGCCGGGG GCTGTGCCAC CCGAGCGCAA TCGCTTCGAG
AAGCAAGTGC AAAACGGCAT CATGGTTGCG GGGGAAATCC CGGTCTCCAC CTTTGCTATC
GATGTCGATA CTGGTAGTTA CACGACCTTA AGGCGAATGT TAAAGGAAGG GCGGTTACCA
CAGAAGGACA CGCTGCGGGT TGAGGAAATG CTGAATTATT TTTCCTATGA CTATCCACTG
CCGGGGAAAA ATGACGCGCC CTTTAGTGTT ACGACCGAGC TTGCACCATC GCCCTATAAC
GATGACATGA TGTTACTTCG CATCGGTTTG AAGGGATATG AGCAGAGTAA GGCTGAACTG
GGCGCCAGTA ACTTAGTGTT TCTGCTGGAT GTGTCAGGGT CGATGGCATC GCCCGATAAG
TTACCTTTGC TGCAAACTGC CTTGAAAATG CTGACCCAGC AATTGGATGC TCAGGATAAG
GTATCGATTG TCGTCTACGC CGGCGCCGCT GGTGTAGTGT TAGATGGTGC AGCTGGTAAC
GATACTCAAA CCCTTAACTA TGCGTTAGAG CAGCTTAGTG CCGGCGGTTC AACCAATGGG
GCGCAGGGGA TTCAGCTTGC CTATCAGTTA GCGCAGAAAC ACTTTGTTGA AGGCGGCATC
AATCGAGTCA TTCTCGCGAC CGACGGTGAC TTTAATGTCG GCACGACCAA CCTCGATGAG
TTAATCGATT TGGTTAGCGC GCGGAAACAA CAGGGCATAG GGCTCACGAC ACTCGGCTTT
GGCATGGGCG ACTACAATGA CCATCTGATG GAGCAATTGG CCGATAAGGG CAATGGGCAA
TATGCCTATA TTGATTCTAT CAATGAGGCG AGAAAAGTGC TGGTGGAACA CTTAAGTGCA
ACCTTACTCA CCATAGCAAA AGAGGTGAAA GTGCAGGTCG AGTTTAATCC CGCTCTTGTG
GCCGAGTATC GCTTGATTGG CTATGAGAAC CGAGCGCTCG CGCGTGAAGA TTTTAATAAT
GACAAGGTGG ACGCGGGCGA AATTGGCGCA GGGCATACAG TCACGGCGCT TTACGAGCTG
CGTTATGTTG ATGCGGGGAA TTTGGCCAAT GATAAACTTC GCTATGGCTA TAATCCCAAA
ACGGGCAATG AAAAATATAG CCGCGACGAA ATCGCCTTTC TGAAATTACG TTATCAGCTA
CCGGATGCGA CTCAAAGCCA GCTACTGAGT TATCCGATTC GAGCAGACCA AAGGGTAAAA
TCATTAGCGC AGGCGAGTGA TGATTTTCGT TTTGCCGCTG CAGTGGCTGG TTTAGGACAG
TTGCTGAATC AAAGCCACTA TTTGCATCAA TTTGATTATA ATAAGCTTAG TGCGCTCACA
CGTTCTGCGC TGGGGGAAGA TACCAGCGGC TACCGACATG AATTTATGCA ACTTGTCGAT
ACCGCTGCGG CACTCGCACA AACACAGCGA GCACCAATCA AAAAATCCTT TGATGTCGGA
GATAAACCTT TCCCGCCCGA GGACAAACTG CATCAGCAAT GA
 
Protein sequence
MRYPSSVFRH KAHLSFVVSG LTLAILLGLS ACSDKAAEQQ TPAELAAQAK LAAEQQAERQ 
ANRQRDAAIA MHEQASSAKL RTMSAESRAY IAQPTASISA APALNGDWPG AVPPERNRFE
KQVQNGIMVA GEIPVSTFAI DVDTGSYTTL RRMLKEGRLP QKDTLRVEEM LNYFSYDYPL
PGKNDAPFSV TTELAPSPYN DDMMLLRIGL KGYEQSKAEL GASNLVFLLD VSGSMASPDK
LPLLQTALKM LTQQLDAQDK VSIVVYAGAA GVVLDGAAGN DTQTLNYALE QLSAGGSTNG
AQGIQLAYQL AQKHFVEGGI NRVILATDGD FNVGTTNLDE LIDLVSARKQ QGIGLTTLGF
GMGDYNDHLM EQLADKGNGQ YAYIDSINEA RKVLVEHLSA TLLTIAKEVK VQVEFNPALV
AEYRLIGYEN RALAREDFNN DKVDAGEIGA GHTVTALYEL RYVDAGNLAN DKLRYGYNPK
TGNEKYSRDE IAFLKLRYQL PDATQSQLLS YPIRADQRVK SLAQASDDFR FAAAVAGLGQ
LLNQSHYLHQ FDYNKLSALT RSALGEDTSG YRHEFMQLVD TAAALAQTQR APIKKSFDVG
DKPFPPEDKL HQQ