Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2200 |
Symbol | |
ID | 4252773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 2627895 |
End bp | 2630174 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638118826 |
Product | vault protein inter-alpha-trypsin subunit |
Protein accession | YP_734330 |
Protein GI | 113970537 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACAG GGAAAAGGGT AAAAGAAGTT TGTCAAACAC TGGCGATGCT AGTGATCGGC GCCTCCATTT GTTGGGGGCT ACCATTTGCG GTGTTAGCCT CACCGAACAG TGTCGGCACC TTATCGGCGC CACAAAATAT TGCCGCGACC TTCGATGAAC ACACAGCGAG CACATTGCCA GCTTTCAGTT TCGATGACGT CACCCAAGGG ATGTTGGTGT ATCAAACCGC CAGCGGCCGT TTAATGCCGT CCTTACCCGT GGACACGCAG GTGTCGATGC AGGTCTCGGG GTTAACCAAT AGGGTGAGCG TAAAGCAGGT TTTCAGTAAT CAAACGGGAT TTGTCCTCAA TGGCCGTTAT CTGTTCCCTT TACCCAATGA GGCGGCGGTG GACTCACTGC GTTTGCACAT TGGAGAGCGG ATCATCGAAG GTCAAATTCA CCCAAAGCAG CAAGCAAAAC AGATTTTTGA GCAGGCCAAA GCCGAAGGGA AACGCGCCAG TCTAGTCAGC CAAGAACGGC CCAATATGTT CACGACCGAA GTAGCCAACC TTGCTCCCGA TGAGGAGTTG GTGGTCGAAA TCAGCTATCA AGAAACCATT CATTATGAAG ATGGTCTCTT TAGCCTACGT TTTCCGCTGG TGGTGGCGCC GCGCTATATT CCTGGGTTAA CGCTAGGCGG AAACAATCAT GAGCGCGTGA CCAGCAGCCA AGTCTTCGAT GCTGACCGAA TTATTGCCCC GATCCGCGAT GCCAGTAGTG AGACGGATCC CGTGCTTAAG GCCGATATTA AGGTCAAGCT CGGTGAAGGC GTGGACAAAT CCGCCGTAGT GAGTCCGTAT CATCCCATTA CGATTGATGA AAAACAGGGC CAACTGACGG CGGCCTTGGC TAATCGCGTG CCCGCTAATC GTGACTTTGT GCTGCAATGG CGTCTTAAGC AGGGCACCAG TCCCGTGGCT TGGGTATTCA ACCAAGCGGG CAAAACCCAT ACGACGCAGG ACGATAATGC GAGTGCAGAT AGTGGTTCAA CTGCCAGTGT CGATGGGAAT AGCAATAGCA ACAATAACTA CAGCTTAGTG ATGGTCCTGC CGCCCAAAGT CGAGGCCAGT GAACAACCGA ACTTGCCCCG TGAGCTTATT TTAGTGATTG ATACATCGGG GTCGATGGCA GGAGATTCCA TCATTCAGGC AAAAAATGCC CTGCGTTATG CGCTGCGCGG TTTAAGGCCA CAGGATAGTT TTAACATTAT CGAATTTAAC TCAGATGTGT CTTTACTGTC GTCAACACCG CTGCCCGCGA CCGCGACTAA TCTCGCCATG GCACGTCAGT TTGTGAATCG TTTGCAGGCC GATGGTGGCA CCGAAATGGC GCAGGCTTTA AATTCCGCGC TGCCAAGACA AGCGTTTAAC ACAGCGTCGG GTGAGGATAA GTCGTTAAGA CAAGTGATCT TTATGACCGA TGGCTCAGTC GGTAATGAGT CGGCGTTGTT TGAGTTAATC CGCAATCAAA TCGGCGACAA CCGTCTTTTC ACCGTCGGCA TTGGCTCGGC GCCGAACTCG CATTTTATGC AGCGTGCGGC TGAACTTGGG CGCGGTACTT TTACTTACAT TGGTGATGTG GATGAAGTGG AGCAAAAGAT CAGCAAGTTA CTAGCCAAAA TCCAGTATCC GGTTCTGACC GATTTGCAGG TGCGCTTTGA CGATGGCTCT GTACCTGATT ACTGGCCCGC GCCTATACCC GATCTATATC GCGGTGAGCC GGTATTGATC AGCCTAAAGC GCCACCCACG TGAGCCTCAG GAACTGGTGA TCTCCGGTCG TCAGGGCCAT AAAAACTGGC AACAGTCGCT ATCTTTGCAA GCCAATGACG CAAGCCATTC TGCAATTGAT GTAGCGCAAC CCACGGCGGG ACTCGATTTG CTCTGGGCGC GCAAACAGAT TGCGGCCTTA GAGCTGAGTA AAAACGGCGC TAACGATGAC AAGGTCAAAC AACAGGTCAC GGCGCTGTCG CTCAACTATC ACTTAGTCAG CCCTTATACC AGCTTGGTCG CGGTGGATTT AACCCCCATC ACCAGCAATG CCATGAGCCG CGATGCTGTG GTGCGCCAGC ATTTACCTTT GGGGTGGCAG CCAATGGGTG TTTTGCCACA AACATCGACC TCGAGCCGTT TCGACATGCT CTTAGGGGGC AGCGTGTTAC TGCTTGCCTT GATGTTGGCG CTGTCGATTC GTCGCCAGCA GCGGCAGCAA CGCGCGCTGT CATTAGCGCT GAATAGTTGA
|
Protein sequence | MITGKRVKEV CQTLAMLVIG ASICWGLPFA VLASPNSVGT LSAPQNIAAT FDEHTASTLP AFSFDDVTQG MLVYQTASGR LMPSLPVDTQ VSMQVSGLTN RVSVKQVFSN QTGFVLNGRY LFPLPNEAAV DSLRLHIGER IIEGQIHPKQ QAKQIFEQAK AEGKRASLVS QERPNMFTTE VANLAPDEEL VVEISYQETI HYEDGLFSLR FPLVVAPRYI PGLTLGGNNH ERVTSSQVFD ADRIIAPIRD ASSETDPVLK ADIKVKLGEG VDKSAVVSPY HPITIDEKQG QLTAALANRV PANRDFVLQW RLKQGTSPVA WVFNQAGKTH TTQDDNASAD SGSTASVDGN SNSNNNYSLV MVLPPKVEAS EQPNLPRELI LVIDTSGSMA GDSIIQAKNA LRYALRGLRP QDSFNIIEFN SDVSLLSSTP LPATATNLAM ARQFVNRLQA DGGTEMAQAL NSALPRQAFN TASGEDKSLR QVIFMTDGSV GNESALFELI RNQIGDNRLF TVGIGSAPNS HFMQRAAELG RGTFTYIGDV DEVEQKISKL LAKIQYPVLT DLQVRFDDGS VPDYWPAPIP DLYRGEPVLI SLKRHPREPQ ELVISGRQGH KNWQQSLSLQ ANDASHSAID VAQPTAGLDL LWARKQIAAL ELSKNGANDD KVKQQVTALS LNYHLVSPYT SLVAVDLTPI TSNAMSRDAV VRQHLPLGWQ PMGVLPQTST SSRFDMLLGG SVLLLALMLA LSIRRQQRQQ RALSLALNS
|
| |