Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_2151 |
Symbol | |
ID | 4604401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 2597618 |
End bp | 2600431 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639781536 |
Product | hypothetical protein |
Protein accession | YP_928026 |
Protein GI | 119775286 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00179894 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTTC GTACCTCCTA TCTCGTGGGC ATTCTGGCCT CTGTCCTGGC CGTATCCACC ATTCCCCAGT TCTCCGAAGT CAGAGCCGCT GAGCCGCTCA AAATCACAGG TCCGGACGGT CAAAGTCGTG AAACCCAGTC GCGCCAGTAT GGTCCTACTA CGGTGCAGGA TACCTTCTGG AGCATTGCCC AAAAAGTTCG CCCGGATAAC AGTGTCAGTG TTTATCAGGT AATGGCGGCG ATTTATGATG CCAACCCCCA TGCCTTCAGC AGTGCCAATT ACAATACCCT CGAAAAGGGC ATGATACTGC TCATTCCGTC CAAGGAAGTA ATGCTGGCCA TTCCCAAAAA TCTGGCTCAG CAAAGAGCAC TGGAGCAAGA CAGGGCATGG CGCACCGGCA AGGTTGAACC AAAGCAGGCT ACAGCCAAGC CCGTCACAGC TAAACCTGCT GCGCAGGAAG CGCCCAAAGC CGTTACCGAA CCTGCTAAAG TTGAGGCGAA ACCCGCTCCC GTACCTGTTG TAGATCCCAA GCTGGTTGAA GAGAACCAGG CGCTGTCGGC CGAGATAAGT CGTCTGACCG AAGAGCTCAA TGTACGCAAA AATGATGTTG AGTCGCTGAG CGCCCAGGTT GAGCAGTTGA CCGAAGAAGT GCAAACCCTG AAAGCCAACC TGGCCGCGAC AACGGAAGAG TCAGAAGCAC TGAGGACTGA AAATAAAACG CTGAAGCAGG CACAGGCACT GGCTGAGGTT GAAGCCAATA AGCCCGGCGA ACTGTGGCGC AGTCTCATGG ACAGCCCCCT TATGCTTATC CTGGGTGCCG TTGTGCCCGC ACTCCTTATT TTGATGGCGG TATGGCTGTT CCTGCGCCGT CGCCGTGGCG GAGCTGATGG CGATGTGGGT GAGACAGCCA AATCTGCTGA TGCGCCCCTG GCAGCGGCCG CTGCCGCGGG GGCTGTTGCG GTGGCAGCCA GCGCCGACGA GGAAGAAATG GCCATCCGTT TGGATCCAGA AGAAGAGGAG TCTCTGGATT CGCTGCTGGA CATGGGCAAT GTTGATCTGG CGCCTGAAAC CTTACCTGAA GAAACGTCTG CAGATCTTTT TGTCGAGCAA GCTGATGATT TTGCCGATGA TGGCCAGTCT CTGGATGATC TCTGGGCCGA GGCGATGGGC GAGCAGGACG AAGGTGAGTT GAACGCAGAG GCGGAAGCTG ATTTGGACGC CTTGATGGCG GGCTTTGAAG AAGAGACCCC TGACGTCGCG AAAGAAGAAG ATTTGGATGC GCTGCTGGCT GGATTTGACG AGCCAGAGCA GACCATCACT ACGACTGAAG AGTCCCAGCC TGAGAAAGAT GAATTGAGTG ATGCCATAGC GGCTGAGCTC GAGACAGAGG TTGCGGATGA CACTGTCAGC GAAGACGACC TCGATGCACT GCTGGCGGGC TTTAATGAGC CGGTTGAAGC CATGGCAGAG TCTGTGGCTG ATAGCGATGA GGCTCAAAGT GATGATCTGG CAGAGGCCAT TGCCGCCGAA CTTGAGTCTG AACTGGCGGG CAATGATGAG GTGAGCAGCG ATGAGGATTT GGACGCGCTG TTAGCGGGGT TTGATGCCCC ATCAGCAAGC GACGATGGCG AGGATGACCT TAAAACGCCG TCAACGGAAA CCCTCGAAGC GGACTGGTCA GAGGCCATTG CTGCAGAGCT CGAAGGCGAA GTCGCGGCCG AAGCAGAAGC AGAAGAGGAT CTTGATGCGC TGCTGGCAAG CTTCGCGACT GAAAGCGAGT CTGAAGCATC AAACGGCCTG GAAAACTCCG CAGATGACGT GTTGGCTGCC GATGAAACAA CCAGCCCGGT TGATGATGCC TTTGCCAACG TGCTTGAAAG TGAGCTTTCT TTAGCCGACG AAAACGCTGA CGAAGACCTG AACGCACTGC TGACTGACTT AGACAAAGGC GATGAAGCCC CGATTGAGGA TATTACTGCG GGTCTTGGTG ATGAGCTGCC GCTCAGCGAT GAAGATGACG ATCTTGATTC TTTACTGGCA GGTTTTGATA AAGAAGAAGA GACTGGGCTT GAAGACGCTG CGTTAGCGGC AGCTGCGGTA GCCACGGCGG CCTCAATCAG TGCGGCCACA TCAAATAAAG AGTCCGAGAA AGCATCAGAG AAAGACAAGA CCGCTGAGAA AGGTGCGCAT GATGCCGATC CTGTGGCCGT GTCAGCCAAA GATTCAGGCT TCTTTGATGA TCTTAAGGGT AAAAAAGGCG CTGCCGGCAG CAATATGCTG GAGTGGGAAA GTATCGCTCC ACCGGATACC AAAAAGCCTT CATTGGATGT GAGTGATGAT GAGCTGCTGT CGGCCTTTGC TGCAGAACAT GATGACTCAG ATGACGATGC CTTTGTGCTG GATGTGGATG CCGACCACAG CATGACAGTG GATGAAGCGC TGGCCGCACT CGACGCCAGT GAAAAGTCCA AGCGGCAGAG TAAAGCAGAA ATAGATGCTG ATCTCAGTAA CTTCCAAAAA GAGAATGGCT TTATTGATAT CGATAAGCTG CTCAATGATG CAGATGATAC CGAACCAGAA CCTTATCGCG AGTTGGATAT GGATATAGGC GAAGTTGAAA GCCTGATTGG CGGTGCCGCC ATGGTCGACG TGGACGATGA GGAAAATTCG GTCAATGCCA AACTGGATTT GGCCCGTGCC TACATCGAAA TTGACGACAA AGACAGCGCC AAAGCCCTGC TCAAAGAAGT GGAAATGGAC GGTAACGAGC GTCAAAAACA GGAAGCGGCC AATCTGTTGC AAGACATAGG CTAA
|
Protein sequence | MNFRTSYLVG ILASVLAVST IPQFSEVRAA EPLKITGPDG QSRETQSRQY GPTTVQDTFW SIAQKVRPDN SVSVYQVMAA IYDANPHAFS SANYNTLEKG MILLIPSKEV MLAIPKNLAQ QRALEQDRAW RTGKVEPKQA TAKPVTAKPA AQEAPKAVTE PAKVEAKPAP VPVVDPKLVE ENQALSAEIS RLTEELNVRK NDVESLSAQV EQLTEEVQTL KANLAATTEE SEALRTENKT LKQAQALAEV EANKPGELWR SLMDSPLMLI LGAVVPALLI LMAVWLFLRR RRGGADGDVG ETAKSADAPL AAAAAAGAVA VAASADEEEM AIRLDPEEEE SLDSLLDMGN VDLAPETLPE ETSADLFVEQ ADDFADDGQS LDDLWAEAMG EQDEGELNAE AEADLDALMA GFEEETPDVA KEEDLDALLA GFDEPEQTIT TTEESQPEKD ELSDAIAAEL ETEVADDTVS EDDLDALLAG FNEPVEAMAE SVADSDEAQS DDLAEAIAAE LESELAGNDE VSSDEDLDAL LAGFDAPSAS DDGEDDLKTP STETLEADWS EAIAAELEGE VAAEAEAEED LDALLASFAT ESESEASNGL ENSADDVLAA DETTSPVDDA FANVLESELS LADENADEDL NALLTDLDKG DEAPIEDITA GLGDELPLSD EDDDLDSLLA GFDKEEETGL EDAALAAAAV ATAASISAAT SNKESEKASE KDKTAEKGAH DADPVAVSAK DSGFFDDLKG KKGAAGSNML EWESIAPPDT KKPSLDVSDD ELLSAFAAEH DDSDDDAFVL DVDADHSMTV DEALAALDAS EKSKRQSKAE IDADLSNFQK ENGFIDIDKL LNDADDTEPE PYRELDMDIG EVESLIGGAA MVDVDDEENS VNAKLDLARA YIEIDDKDSA KALLKEVEMD GNERQKQEAA NLLQDIG
|
| |