Gene Mmar10_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1049 
Symbol 
ID4285330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1143413 
End bp1145080 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content63% 
IMG OID638140520 
Productvon Willebrand factor, type A 
Protein accessionYP_756280 
Protein GI114569600 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.691019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.362005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCG TTCTTTTGAT TGGTACGGCT GGCCTGGCCA TGCTCGCAGC CTGTACAGCG 
ACCGTTGCAC CGCCAACGGC CCCACCTCCG CCACCGGCAC CGCCACCCCC TCCGCCGGCG
CCCTTCATGT CGGAAATGGT CACCGTGACC GGGTCCCGTG TCCGCAGCTC GGCCTATCGC
AATGACGCGA TTGCCGGCGT TGCGAATTTC ACGATGCCCG ATCAGCTGGT CGTCGACCGC
GAACGCTATG AAGACGTCGA TCCCAATCCG GTGATGTCAA CGGCTGATGA GCCGGTCTCG
ACCTTCTCCA TCGATGTCGA CACGGCCAGC TATTCGCTGG TCCGCAACTC GCTGGAGGCC
GGCCGCTTGC CGCCGACCGA TGCCGTGCGG ATCGAGGAGA TGGTCAATTA TTTCGACTAT
GACTATGCCC TGCCGCCGGG ACCCGACGAG CCGTTCGCGA CCCATGTGAC GGTCACCCCG
ACGCCGTGGA ATGCCGACAC CCAGCTGATG CATATCGGCA TCCAGGGTTA TGAGATCATT
CCCGACGAAC GGCCGCGGGC CAATCTGGTC TTCCTGATCG ATGTATCCGG TTCGATGAAT
TCACCCGACA AGCTGCCGCT GGCTGTCCAG GCCATGCACT TGCTGGTTGA TGAATTGCAC
CCGGACGACA CCGTGGCCCT GGTTGTCTAC GCCAGTGCCA GCGGGGTCGT ATTGCCGCCG
ACCGAGGCCC GCAATGCCCG CGAGATCCAC CGCGCACTCG ACAGTCTGAG CGCAGGCGGT
TCAACCGCCG GCGGGGCCGG TCTGGCGCTG GCCTATGACC TCGCCGAACA GAATTTCGAC
GAGGACGCGG TCAATCGGGT CATGCTTCTC ACCGATGGCG ATTTCAATGT CGGCGTGACC
CAGGATGAGC GTCTGGAAGA CTTTGTCGCC CGCAAGCGCG ATAGCGGCAT CTACCTGTCG
GTGATGGGCT TTGGTCGCGG CAATTACAAT GACCAGATGA TGCAGACCAT CGCCCAGGCC
GGCAATGGCA CGGCGGCCTA TATCGACAGT CGCCAGGAAG CCCGGCGCAT GCTGGTCGAG
GAAAGCTTCT CCTCGCTCTT TACGATCGCC AATGACGTCA AGATCCAGGT CGAGTTCAAT
CCGGCCCGTG TCGCCGAATA CCGGCTGATC GGCTATGAGA CCCGGCTGCT TGACCGGGCC
GATTTCAACA ATGATGAAGT CGATGCCGGC GAGATCGGTT CGGGCCATTC GGTCACCGCG
ATCTATGAGA TTGCCGCGCC GGGATCGGAA GGCGTGCTGA TGGAGCCGCT GCGCTATGGC
GGAGCGACGA CGGTCGAGCC GGACCTGGAC GGCGAGTATG GTTTCCTGCG TCTGCGCTAC
AAGCGGCCGG GCGAGGATGA AAGCCGCCTG ATCGAGCGCG CGGTGACCGA TGCCGACACG
GTCGACGATG TCGGCGAGGC CTCGCAGGAG GCCCGCTTCT CGATCGCGGT CGCCGGTTTC
GCCCAACGCT TGCGCGGTGA TGCGTATCTC GCGGAGGGCT ATGACTGGGC CGCGATCCGC
GAAAGCGCTG CCGGGGCGAT CGGTGAAGAT GAATTCGGTT ACCGCGCCGA GTTTCTCGAC
CTGGTCGATC GGGCCCAGGC CGCCGATGAG GCTCGCGACA AACCCTGA
 
Protein sequence
MKRVLLIGTA GLAMLAACTA TVAPPTAPPP PPAPPPPPPA PFMSEMVTVT GSRVRSSAYR 
NDAIAGVANF TMPDQLVVDR ERYEDVDPNP VMSTADEPVS TFSIDVDTAS YSLVRNSLEA
GRLPPTDAVR IEEMVNYFDY DYALPPGPDE PFATHVTVTP TPWNADTQLM HIGIQGYEII
PDERPRANLV FLIDVSGSMN SPDKLPLAVQ AMHLLVDELH PDDTVALVVY ASASGVVLPP
TEARNAREIH RALDSLSAGG STAGGAGLAL AYDLAEQNFD EDAVNRVMLL TDGDFNVGVT
QDERLEDFVA RKRDSGIYLS VMGFGRGNYN DQMMQTIAQA GNGTAAYIDS RQEARRMLVE
ESFSSLFTIA NDVKIQVEFN PARVAEYRLI GYETRLLDRA DFNNDEVDAG EIGSGHSVTA
IYEIAAPGSE GVLMEPLRYG GATTVEPDLD GEYGFLRLRY KRPGEDESRL IERAVTDADT
VDDVGEASQE ARFSIAVAGF AQRLRGDAYL AEGYDWAAIR ESAAGAIGED EFGYRAEFLD
LVDRAQAADE ARDKP