Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0721 |
Symbol | |
ID | 4251757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 840521 |
End bp | 842029 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 638117284 |
Product | curlin-associated protein |
Protein accession | YP_732858 |
Protein GI | 113969065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000376706 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAC AAGCGAAAAA ATCACTCATT GCGCTAGCTA TTGCAACTGG CTTAAGTGGC CAAGCATTTG CTGCCAGTCT GATTAACGAC ATCAGCGTTG AACAAACTGG CCAAGGTCAA GACACCTTAG TGGCGCAAAC AGGGTTAATT AACGCTGCAG CAGTTACTCA AACGGGTAAC GACCAAGTTG CCACCGTATT GCAAGACGGT GTGTGGCATG AAGCGCAAGT TAACTCAACG GGTGATGCCA ACCAAGTCAC TGTGACTCAA CAAACGGATT GGCATGTTGC GTCAGTTAAC GTCACTGGCA ATAACAACGA AGCTGAAGTA GCGCAGGATG GTTTCTTCAA CCAAAGCAGC AACGACATAG CTGGTAACGA CAACCTAGTT TCTGTGAACC AATTAGGTGA AGTGAACGAA AGCTACGTTG AAATCACTGG CAACGAAAAC AGCGCCTTCG TAGAACAAGA AGGTGATGCT AACCTCGCAG TATTCCGCGT TCAAGGCGAT AACAACGATG GCGACATCAA GCAATACGGT AGCAACAACC AAGCGGGTTT AATCGCACTC GACCTGACCG CTAACGTGGG CAATAACAAC GACGTATCGG TTGAACAGAT TGGTAACAAC AACTTTGGTG CGGCTAAAGG CATCGCGGGT AACGACAACA GCGTCGATAT CTATCAAAAG GGTGACAGCC ACACTGGTTT CGTTTACGCC TTAGCAGGTA GCGAAAACGA CATCACCATG AAACAAGAAG GCAGCAGCAA CACTGCTTAC CTGTCAATGA CCACAGGCGA TGACAACAGC ATCGATATCG CCCAAGACGG CAACCGCAAT ACTGTAGGCG ATACTTTAGT TGCCGACATC CAAGGTAACG ACAACGATAT CACTATCAAG CAACGTGGCG ACAGCAACGG TGCAGAGTTC CAAGTATGGG GCGATAGCAA CGACGTTGAC TTAAAACAAC GTGGTGATGC TAACTTCGCG ACCTTTGGTG CTTACGGTAC TGACAACGAT TTCGACTTAT CTTCTAAGGG TGATAACAAC GAGCTGGTTG CCTTCGCAAC AGGTGAAGAC AACAGCATCG AAATCAGCCA AGAAGGTGAT ACGAACTTCG CTTATGTGGA TGCAGTGGGT AACGACAACG AAGTGGATGT TGAGCAAGAT GGTGATCAAA ATGAGACTAT CATCAGTGTA ACGGGTAACA ACAACGCCGA TGTGACTGCA CTGCAACACC GTGGCGATCT GAACCTTATC GATTTAATCA TCGAAGGTGA TGAAAACTCA GCTCAAATCA CTCAAGCGGG TAACGGTAAC TGGGTGGGTG GCGATAGCGG CACTTCATTC GCATCAAACT CATTCGGTGT GCGTGGTGAT AACAACAGCC TAATGATCAC CCAAACTGGT AATGACAACT TAGTTGTCGG TTCGCAAGCA GGCAACAGCA ACAGCATCAG CGTTAACCAA ACCGGTGATA TGAACGTTGC GACCGTTGTT CAGTACTAA
|
Protein sequence | MKSQAKKSLI ALAIATGLSG QAFAASLIND ISVEQTGQGQ DTLVAQTGLI NAAAVTQTGN DQVATVLQDG VWHEAQVNST GDANQVTVTQ QTDWHVASVN VTGNNNEAEV AQDGFFNQSS NDIAGNDNLV SVNQLGEVNE SYVEITGNEN SAFVEQEGDA NLAVFRVQGD NNDGDIKQYG SNNQAGLIAL DLTANVGNNN DVSVEQIGNN NFGAAKGIAG NDNSVDIYQK GDSHTGFVYA LAGSENDITM KQEGSSNTAY LSMTTGDDNS IDIAQDGNRN TVGDTLVADI QGNDNDITIK QRGDSNGAEF QVWGDSNDVD LKQRGDANFA TFGAYGTDND FDLSSKGDNN ELVAFATGED NSIEISQEGD TNFAYVDAVG NDNEVDVEQD GDQNETIISV TGNNNADVTA LQHRGDLNLI DLIIEGDENS AQITQAGNGN WVGGDSGTSF ASNSFGVRGD NNSLMITQTG NDNLVVGSQA GNSNSISVNQ TGDMNVATVV QY
|
| |