Gene Shewmr4_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2200 
Symbol 
ID4252773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2627895 
End bp2630174 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content52% 
IMG OID638118826 
Productvault protein inter-alpha-trypsin subunit 
Protein accessionYP_734330 
Protein GI113970537 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACAG GGAAAAGGGT AAAAGAAGTT TGTCAAACAC TGGCGATGCT AGTGATCGGC 
GCCTCCATTT GTTGGGGGCT ACCATTTGCG GTGTTAGCCT CACCGAACAG TGTCGGCACC
TTATCGGCGC CACAAAATAT TGCCGCGACC TTCGATGAAC ACACAGCGAG CACATTGCCA
GCTTTCAGTT TCGATGACGT CACCCAAGGG ATGTTGGTGT ATCAAACCGC CAGCGGCCGT
TTAATGCCGT CCTTACCCGT GGACACGCAG GTGTCGATGC AGGTCTCGGG GTTAACCAAT
AGGGTGAGCG TAAAGCAGGT TTTCAGTAAT CAAACGGGAT TTGTCCTCAA TGGCCGTTAT
CTGTTCCCTT TACCCAATGA GGCGGCGGTG GACTCACTGC GTTTGCACAT TGGAGAGCGG
ATCATCGAAG GTCAAATTCA CCCAAAGCAG CAAGCAAAAC AGATTTTTGA GCAGGCCAAA
GCCGAAGGGA AACGCGCCAG TCTAGTCAGC CAAGAACGGC CCAATATGTT CACGACCGAA
GTAGCCAACC TTGCTCCCGA TGAGGAGTTG GTGGTCGAAA TCAGCTATCA AGAAACCATT
CATTATGAAG ATGGTCTCTT TAGCCTACGT TTTCCGCTGG TGGTGGCGCC GCGCTATATT
CCTGGGTTAA CGCTAGGCGG AAACAATCAT GAGCGCGTGA CCAGCAGCCA AGTCTTCGAT
GCTGACCGAA TTATTGCCCC GATCCGCGAT GCCAGTAGTG AGACGGATCC CGTGCTTAAG
GCCGATATTA AGGTCAAGCT CGGTGAAGGC GTGGACAAAT CCGCCGTAGT GAGTCCGTAT
CATCCCATTA CGATTGATGA AAAACAGGGC CAACTGACGG CGGCCTTGGC TAATCGCGTG
CCCGCTAATC GTGACTTTGT GCTGCAATGG CGTCTTAAGC AGGGCACCAG TCCCGTGGCT
TGGGTATTCA ACCAAGCGGG CAAAACCCAT ACGACGCAGG ACGATAATGC GAGTGCAGAT
AGTGGTTCAA CTGCCAGTGT CGATGGGAAT AGCAATAGCA ACAATAACTA CAGCTTAGTG
ATGGTCCTGC CGCCCAAAGT CGAGGCCAGT GAACAACCGA ACTTGCCCCG TGAGCTTATT
TTAGTGATTG ATACATCGGG GTCGATGGCA GGAGATTCCA TCATTCAGGC AAAAAATGCC
CTGCGTTATG CGCTGCGCGG TTTAAGGCCA CAGGATAGTT TTAACATTAT CGAATTTAAC
TCAGATGTGT CTTTACTGTC GTCAACACCG CTGCCCGCGA CCGCGACTAA TCTCGCCATG
GCACGTCAGT TTGTGAATCG TTTGCAGGCC GATGGTGGCA CCGAAATGGC GCAGGCTTTA
AATTCCGCGC TGCCAAGACA AGCGTTTAAC ACAGCGTCGG GTGAGGATAA GTCGTTAAGA
CAAGTGATCT TTATGACCGA TGGCTCAGTC GGTAATGAGT CGGCGTTGTT TGAGTTAATC
CGCAATCAAA TCGGCGACAA CCGTCTTTTC ACCGTCGGCA TTGGCTCGGC GCCGAACTCG
CATTTTATGC AGCGTGCGGC TGAACTTGGG CGCGGTACTT TTACTTACAT TGGTGATGTG
GATGAAGTGG AGCAAAAGAT CAGCAAGTTA CTAGCCAAAA TCCAGTATCC GGTTCTGACC
GATTTGCAGG TGCGCTTTGA CGATGGCTCT GTACCTGATT ACTGGCCCGC GCCTATACCC
GATCTATATC GCGGTGAGCC GGTATTGATC AGCCTAAAGC GCCACCCACG TGAGCCTCAG
GAACTGGTGA TCTCCGGTCG TCAGGGCCAT AAAAACTGGC AACAGTCGCT ATCTTTGCAA
GCCAATGACG CAAGCCATTC TGCAATTGAT GTAGCGCAAC CCACGGCGGG ACTCGATTTG
CTCTGGGCGC GCAAACAGAT TGCGGCCTTA GAGCTGAGTA AAAACGGCGC TAACGATGAC
AAGGTCAAAC AACAGGTCAC GGCGCTGTCG CTCAACTATC ACTTAGTCAG CCCTTATACC
AGCTTGGTCG CGGTGGATTT AACCCCCATC ACCAGCAATG CCATGAGCCG CGATGCTGTG
GTGCGCCAGC ATTTACCTTT GGGGTGGCAG CCAATGGGTG TTTTGCCACA AACATCGACC
TCGAGCCGTT TCGACATGCT CTTAGGGGGC AGCGTGTTAC TGCTTGCCTT GATGTTGGCG
CTGTCGATTC GTCGCCAGCA GCGGCAGCAA CGCGCGCTGT CATTAGCGCT GAATAGTTGA
 
Protein sequence
MITGKRVKEV CQTLAMLVIG ASICWGLPFA VLASPNSVGT LSAPQNIAAT FDEHTASTLP 
AFSFDDVTQG MLVYQTASGR LMPSLPVDTQ VSMQVSGLTN RVSVKQVFSN QTGFVLNGRY
LFPLPNEAAV DSLRLHIGER IIEGQIHPKQ QAKQIFEQAK AEGKRASLVS QERPNMFTTE
VANLAPDEEL VVEISYQETI HYEDGLFSLR FPLVVAPRYI PGLTLGGNNH ERVTSSQVFD
ADRIIAPIRD ASSETDPVLK ADIKVKLGEG VDKSAVVSPY HPITIDEKQG QLTAALANRV
PANRDFVLQW RLKQGTSPVA WVFNQAGKTH TTQDDNASAD SGSTASVDGN SNSNNNYSLV
MVLPPKVEAS EQPNLPRELI LVIDTSGSMA GDSIIQAKNA LRYALRGLRP QDSFNIIEFN
SDVSLLSSTP LPATATNLAM ARQFVNRLQA DGGTEMAQAL NSALPRQAFN TASGEDKSLR
QVIFMTDGSV GNESALFELI RNQIGDNRLF TVGIGSAPNS HFMQRAAELG RGTFTYIGDV
DEVEQKISKL LAKIQYPVLT DLQVRFDDGS VPDYWPAPIP DLYRGEPVLI SLKRHPREPQ
ELVISGRQGH KNWQQSLSLQ ANDASHSAID VAQPTAGLDL LWARKQIAAL ELSKNGANDD
KVKQQVTALS LNYHLVSPYT SLVAVDLTPI TSNAMSRDAV VRQHLPLGWQ PMGVLPQTST
SSRFDMLLGG SVLLLALMLA LSIRRQQRQQ RALSLALNS