Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_1478 |
Symbol | |
ID | 4478115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 1718719 |
End bp | 1722066 |
Gene Length | 3348 bp |
Protein Length | 1115 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639726048 |
Product | hypothetical protein |
Protein accession | YP_869118 |
Protein GI | 117919926 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000313073 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.981736 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTC GCACTTCGTA TCTTGTGGGC CTGATGGCCA CCGTTTTAGC TGTTTCTTCC ACGACCTCTT TTATTGATCC TGCGATAGCT GCTGATACTC CCAAAATAAA ACCTTTAAAA ATCATGGGGC CAGACGGACA GATCCGTCAG GTCAATCATC AATATGGACC AACGACCTCT AAGGACACTT TCTGGAGCAT TGCGCAAAAA GTCCGTCCCG ATGCCAGCGT CAGTGTTTAT CAGGTGATGG CCGCGGTATT TGATGCTAAC CCCCATGCGT TTAACTCCGA CAGTTATAAC AGCCTCGAAC GTGGCATGAT TTTGCTGATC CCTTCGAAAG AAGTGATGCT CGCCATTCCC AATAGTCTGG CCATTGCCCG CGCCGAACGC AGCGATAAGC AGGCCCGTTC TGGCGCTAAA GCCACGCCTA GGGTAGAAAC CAAACCTGCG ACTAAGCCAG CTGCACCGAC TCAGACCGTG GCACAGCCTA AACCTAAGGT GGATACGCCC CCTAAGGCCG AGGTGAAGCC TGAAGCGGCA ACAGTGACTA AAGCGGAAGT AGCTGCGACG CCAACCGCGC CTAAGATGGT GAGCTCGGAT ATCAATGCAC GCCTCGAAGC GGCGGAGAGC AAGAATTTAT CCCTGACTGA CGAGTTAGCC AGAGCGCAAG ATCAACTCGC GGTGAGAAAT ACCGATGTTG AGACACTCAA AGCTAAGGTT GAAGAATTAA ATCAACAGAT TGCGGTATTA GAAGAAACTC TGCAGGCCAG CAAGAAGCAA AACCTTGCCC TTAAAGCCGA GGTTGAGGCT GCACAAACGC CAGAGACCGC GACAACAGAC GAACCTGCCA TGCCAGCAGA GCCCGATGAT CTGTGGCGCA ACCTGATGAG TAATCCGCTA CTCCTCGTTG CGGCGGCGGT GATCCCCGCA TTATTGCTAC TGGTCCTCGT CTTCTGGCTA TTACGCCGTA AACGCAATAA AGAGCGCCGC GAGCTAGAAG TGCAGCAGGC GGCTATGATG GCCGCGGGTG CGGCTGGCGT GGCGAGTGCG TCTGTGCTGG CGATGGACGA TGATGATCTT AGCGACATGG CGGTGCATTT AGATACTGAC CATGCCGATT CCATCGACAG TTTACTCGAT GTCGGCAGTG TCGACCTGCA ACCCGAGCAG GAAATGACCG ATAGCCATGA ACAGATGGAC ATGGCATCAG AAATGTTTAT TGACCCAGGT GTTAGCACTG AACCTCAGTT TGAAGAGGAA GAAGGCCAGT CTCTAGACGA TCTCTGGGCT GAGGCCATGG GTGATCAAGA GGAAGCCGAA GCTTTTGCCG AAGAAAAAGC CGCATCCGGC AATATTCTGG AGGAAGAAGA TCTCGATGCA CTGCTGGCTG GATTAGGCTC TGATGAAGAA GATACCCCGC AGGAAGCTCA AGCCACTGCT GCCGGGCTTG ACGAACTCGA CGAATTAGCC GGTTTCGATT TACCTGCTGA ACCAGAGCCC GCCGCTGAGG ATGAAGATTT AGCCGCCGCG ATTGCCGCCG AACTCGATTC TGAGCTTGAA AACGACGCAG CGCCAGCCGA CGATGATATC GATGCGTTAC TCGCAGGCTT TAACCAGCCC GAGGTTAACG AGCCTGCCAC AGCGGCCGAT GAGGATCTTG ATGCCTTACT GACCGATGAT GCTCAGGCAC CAGCAGAAGC AGCTGACGAT CTCGGCGATG AAATTGCCGC CGAGTTGGAT GATGACTTAG CCGAGTTAGG CGTCGCGCAA ACCGATGATA TTGATGCTTT ACTGGCAGAT TTTGATAAAC CCGCCGCGCC CGAACCCGAT TTAAGCGATG AAATTGCCGC CGAGCTCGAT GATGACTTAA CTGATATAGG CGTAAGCGAA TCCGATGATA TCGATGCGCT TCTTGCCAGC TTTGATGCTC CCGCGCAGCC CGATCCTGAA GTCGCCGCGG CGATTGCTGC CGAGTTGGAT GATGAACTCG CTCCCGACTT ACCAACGAGC GATGAAGACC TCGACCAGTT ATTGGCAGGC TTTAACACGC AAGAAGCGCT CGTCGAAGAG GCTGCTGCCG ACGTTACTGG AGCCGACGAT GCTGAGGCCG AGGATATGGC TCAAGTTGAA GCCGCCCTTG AGCGCGATGA CATCGCCACC GAACTCGAAG CCGCATTACC TGAAGACCTG TCGATGGCTG ATGACGACCT CGATTCACTC TTAGCCGAGT TCGATGTGCC AGAACAAGCC GAGGCTGATG CCGACGACTT TAACTTCGAC TTAGAAACCA CGCATGTCGA GCCAGAAGAA AAAGACGAGG AAGAAGATTT ACTCAAGGGC ATTGGTGCCG CCCATGCCAT TGATGGCACA GACGATACTT TCGAGCTGAG CGAAGAGATT GCGGATAACC AAGACAGTGC GGATAACGCT GCCGCAGAAC CTAAGTTTAC AGGCTTTGAT GAAGATACTG ACGATATCGC GCCGGCACTG GCGGGTGCCG CTGTCGCCGC ATCAGTAGCG GCTGCAGCAA GTGCGAAGGA AAAAGAGTCA TCTTTCTTCA ATGACTTAAA GGCCAATAAG ACTAAAGATC CCCATGTGCT CGATTGGGAT ACCGAACTGA ATTTTGCGCC TACGCCTAAG GCGGAAACGC TGGTCAAACC GGAAACCGAT TTGCCTTTAG ATCCTGTACC AGTGCAGGGC ACTAAACGTC AACCGCTGGA GTTTGCGCCA GATGCGAAAC CAGAGGTTGA TGCCAAAAAT ACAGCTGCGA GCACTGAGTT TAACCTCGAC GATGAGCTGG ATTTGGGCCG CGAGTTTGAT TTAAAAGACG ATACCGATGA TGTCGATTTA AGCGATGACA GCGTGCTTGC CGCCTTTAGC ACCAATACCG AACCCGAAGA GGAAGAGGTA TTAACGCCTG AGGATAGCTT TGCCCTCGAT GATGAACACA CGCTCACCGT CGATGAGGCC TTAGCCGCCC TCGATGCGCA GGAATCCCGC AAGTCGAAGA AGTTTGTTCC AGAGCATGAT TTAACTAACT TCCAAAACGA GCATGGTTAT ATCGATATCG AGAAACTGCT TAACGATGCT GAGCAAGACA ATACTGAGAC AGATCTTTAT CCCGAAATGG ATGTTGAGAT GAGTGATGTG GGCGCGCTGA TTGGCGATGC TGCCATGATC GACGTGGACG ATGAGGAAAA TTCAGTCAAT GCCAAGCTCG ATTTAGCCCG TGCCTATATC GAAATCGATG ACAGTGACAG TGCTAAAGCA TTGCTACGGG AAGTCCAAAT CGACGGTAAC GAGCGTCAGC AAGAAGAAGC GGCCCGTTTA CTTAAAGAGA TGGGTTAA
|
Protein sequence | MKFRTSYLVG LMATVLAVSS TTSFIDPAIA ADTPKIKPLK IMGPDGQIRQ VNHQYGPTTS KDTFWSIAQK VRPDASVSVY QVMAAVFDAN PHAFNSDSYN SLERGMILLI PSKEVMLAIP NSLAIARAER SDKQARSGAK ATPRVETKPA TKPAAPTQTV AQPKPKVDTP PKAEVKPEAA TVTKAEVAAT PTAPKMVSSD INARLEAAES KNLSLTDELA RAQDQLAVRN TDVETLKAKV EELNQQIAVL EETLQASKKQ NLALKAEVEA AQTPETATTD EPAMPAEPDD LWRNLMSNPL LLVAAAVIPA LLLLVLVFWL LRRKRNKERR ELEVQQAAMM AAGAAGVASA SVLAMDDDDL SDMAVHLDTD HADSIDSLLD VGSVDLQPEQ EMTDSHEQMD MASEMFIDPG VSTEPQFEEE EGQSLDDLWA EAMGDQEEAE AFAEEKAASG NILEEEDLDA LLAGLGSDEE DTPQEAQATA AGLDELDELA GFDLPAEPEP AAEDEDLAAA IAAELDSELE NDAAPADDDI DALLAGFNQP EVNEPATAAD EDLDALLTDD AQAPAEAADD LGDEIAAELD DDLAELGVAQ TDDIDALLAD FDKPAAPEPD LSDEIAAELD DDLTDIGVSE SDDIDALLAS FDAPAQPDPE VAAAIAAELD DELAPDLPTS DEDLDQLLAG FNTQEALVEE AAADVTGADD AEAEDMAQVE AALERDDIAT ELEAALPEDL SMADDDLDSL LAEFDVPEQA EADADDFNFD LETTHVEPEE KDEEEDLLKG IGAAHAIDGT DDTFELSEEI ADNQDSADNA AAEPKFTGFD EDTDDIAPAL AGAAVAASVA AAASAKEKES SFFNDLKANK TKDPHVLDWD TELNFAPTPK AETLVKPETD LPLDPVPVQG TKRQPLEFAP DAKPEVDAKN TAASTEFNLD DELDLGREFD LKDDTDDVDL SDDSVLAAFS TNTEPEEEEV LTPEDSFALD DEHTLTVDEA LAALDAQESR KSKKFVPEHD LTNFQNEHGY IDIEKLLNDA EQDNTETDLY PEMDVEMSDV GALIGDAAMI DVDDEENSVN AKLDLARAYI EIDDSDSAKA LLREVQIDGN ERQQEEAARL LKEMG
|
| |