Gene Shewmr4_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1403 
Symbol 
ID4251422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1634481 
End bp1635497 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content51% 
IMG OID638118002 
Productvon Willebrand factor, type A 
Protein accessionYP_733538 
Protein GI113969745 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.520734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000232326 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAACAG TCGCATGGCC TTTGGCATTG ATTCTACTGC CTTTACCTTT TATTTTTTGG 
CGTCGTCAAA CGGTGCAAGC CGAAGGTGGT CGCCTGCAAT TGCCCGGCAT TAGTCAAACG
GGCAAGGCCA ATATCAGTAG CCACAGCCGA CAAAGCCGCA AGCGTTATTG GTTGATGTGG
AGCTTGTTAG TGCTTGCTAT CGCCCGCCCA CAATGGCTTG GCGATCCAAT TGAACTGCCA
AGCCAAGGGC GGGATCTGAT GCTGGCGGTG GACTTATCCG GCAGTATGCA AATTGAAGAT
ATGGTGATTA ACGGTAAAGT CGTCGACCGT TTTACCTTGA TCCAACACGT GGTCAGTGAG
TTTATAGAGC GTCGCAAAGG CGATCGTATC GGTTTGATTC TATTTGCCGA TCACGCTTAT
CTACAAGCGC CACTCACCCA AGATAGACGC TCTGTCGCTC AGTTTTTAAA GGAGGCGCAG
ATTGGCCTTG TCGGTAAGCA AACGGCGATT GGCGAGTCGA TTGCGCTCGC GGTCAAGCGC
TTCGATAAAA TGGATGAGAG TAACAGAGTC TTGATTTTAT TGACTGACGG TTCAAATAAC
GCCGGTAATA TCGATCCTGA TCAAGCGGCC CAAATCGCCG CTAATCGCAA AGTGACGATT
TATACCGTGG GCGTGGGTGC CGATGTCATG GAAAGACGCA CCCTGTTTGG TCGTGAGCGC
GTCAATCCGT CGATGGATCT CGATGAAAAC CAACTCAAGC ATATTGCCGA AGTGACCCAT
GGCCGCTATT TCCGCGCCCG TAACAGCCAA GAGCTGGAAC AGATTTATCA AGAAATTGAC
AAACTGGAAC CCGTGAGTCG CGATCAACTC AGCTATCGTC CTCAAGCGGA GCTATTCTAT
TGGCCGCTCG CACTCGCGCT GCTCACCAGC ATTTGGATAG CCTTAGGCCA ATTAGCGCTG
TTTTCGCGGA CTAAAAACCC TGTTCCATCA GCAACACCAC GGGGGGATGT TCGTTAA
 
Protein sequence
MLTVAWPLAL ILLPLPFIFW RRQTVQAEGG RLQLPGISQT GKANISSHSR QSRKRYWLMW 
SLLVLAIARP QWLGDPIELP SQGRDLMLAV DLSGSMQIED MVINGKVVDR FTLIQHVVSE
FIERRKGDRI GLILFADHAY LQAPLTQDRR SVAQFLKEAQ IGLVGKQTAI GESIALAVKR
FDKMDESNRV LILLTDGSNN AGNIDPDQAA QIAANRKVTI YTVGVGADVM ERRTLFGRER
VNPSMDLDEN QLKHIAEVTH GRYFRARNSQ ELEQIYQEID KLEPVSRDQL SYRPQAELFY
WPLALALLTS IWIALGQLAL FSRTKNPVPS ATPRGDVR