Gene Sama_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1009 
Symbol 
ID4603261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1220995 
End bp1224441 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content54% 
IMG OID639780348 
Producttype IV pilin biogenesis protein, putative 
Protein accessionYP_926886 
Protein GI119774146 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTGCC TCTTGTGGAC GCCCCTGTAT GCGGATGACA CGTCTTTGTA TGTATATGAG 
TCCTCCAATC GCTCGGATGA GCGACCACAG GTGCTGGTTA TCTTTGATAA CTCAGGCAGC
ATGGATACCA CTGTATATGG TGTGAATCCA TCGTTTTCCA GTGCAGACGG GGACCTGTCA
GAGAGTGAAC AAGTCTATTA CAGCCTGGAT GCGGCAGAAG CGCCGCCCAA TCCGGCAAAC
CCCGCTGAAA AGCGCTTCTT TACCTTCAAA CGCAATGCCT GTGCCAGCTC TTTTGAATTC
CTTAAAGAGC AGGGTGTTTT CACGGGTTTT ATGCGCCACT ATGTTTACAG TGGGCAAACC
GGTAGCTGGG AGGAGTTTCC CCGCAGTGAC GGCGCCAGCA TTCGCATGGT GGAGTGTTTT
GAAGATATTC AGGATAAAAA TTACGACAAC GGCAGTGTCG CCAAAGATGG TCTTCCGGTA
GACGGCGAGG GTCGCCGGGG CAGTCCTTCT CCCTTTTTTC GGGTCTCCAG CGGCAGCAAG
GAGGCAACCA AAGAGCTGGC GATGTCCAAG GCAAAAAATA CCGGATTTGG TACCGGTAGA
GTCGTCACTC TGTACACAAA AACTTACCTT ACCTGGTATC ACAGCAAGAA AAAGCAGGTC
AATCGAACCC GTATCGATAT CGCCAAGGAA GCGGTGACTA ATGTGCTGTT AACTACGCCG
GGTGTGGATT TTGGTTTGGC CATTTTTAAC AGTAACGTTT ACGAAGGCTA TGACGACGGT
GGACGTATTA TCGCTGGCAT TAAACCTGCC ACCGCCAGCA ATAAAAAGGA TTTGATTGAA
GCCGTCGATT TACTGAAAGG GACAACCTGG ACCCCGCTGT GTGAAACCCT CTACGAGGCA
TACCGGTATT TTTCCGGCGG TGAAGTTTGG TTTGGTGATG ACGACCCTAC CCTGAAACCC
TATCGCGACA AAGACGCCAT TGATGCAAAT AGCCGCTATA AGTCGCCATT CAAGACTTGC
CAGAATCGCG CTTATATCGT CTATGTGACT GATGGTGAGC CCACCCGCGA CAGTAACGCC
AATCAGCTGG TGTATGACTT AACCGGCGGC GTTGATGCCT ATACCTCCAG CCCTGCCAGT
TATTTAAGCT CGCTCTCTTC CTGGATGAAC ACCCAAGATG TCAACCCCAA TATGACGGGG
AAGCAGAGTG TATCTACTTA CACCATTGGT TTCAGTCAGG GCGCAGCGTC GGCGGCTGGC
CTGCTGCGTC ATACTGCCGA AAAGGGCGGT GGCAAGTATT ATGATGCAAC CAACGTGGAT
GACTTGCAGA AATCACTGAT GCAGGTGTTT AAAAACATTC TCGAAAAGAA TGCCAGCTTC
ACCGCCCCCG CCGTGGCCAG CAACAACTTC AACCGTATCC GGACCTTCGA TTCCGTTTAC
TACTCGATGT TTTTGCCAAA CCGGGGGCCA CGTTGGAGCG GGAATCTCAA AAAATTCAAG
GTCACCGACA GTGGCGACAT CATAGATGCC AGCAAGGCCA AGGTCATTGA CGGCGATGGC
AATATCGCCA AGACCGCTTG CTCCCATTGG AGCAGCAGCG CTGATTGCAG CGCGGGTGAC
GGTAATGATG TGCGCCGTGG CGGTGCTGCC GGTATGTTGC AGCGAATGCA GGCCAGGGAC
CGCAATCTGT TGTCGGACGT CGGCGGGCTG AAGCCGCTGA CGCTGAGCGC CGCCAGAACC
AAGGCCGGTG GTGACGGTGC CCTTGCCACC TTGCTCGGGG TGGAGGAGTC AGAAGTCGCG
AGCCTGATTG ACTGGGCCCG AGGTGTGGAC GTGGATGACG ACAATGACAA CGGCGATCGC
ACCGAGATGC GCGCCGATGT TATGGGCGAC CCCCTGCACT CCAAACCGCT TGCCATCAAC
TTTGGCAGCG AGGGTTCACC GGATATTCGG GTGATAGTGG GCACCAACCA CGGTGTGCTG
CACATGTTTA AAGATGAAGG CACCAGTGTG TCTGAATCCT GGGCTTATCT GCCGTGGGAA
ATGCTGCCAA AGCAGGCCAC GCTGCGGGAA AACCTGCCGT CGGGTCGCCA CTCTGTGTAC
GGCATTGATG GCTCACCCGT CGCCTGGGTG AAAAACGGCG CTTCCGGTAT TCAAAAGGCC
TGGCTGTTTT TCGGTCTTCG CCGCGGTGGC GATGCCTACT ATGCCCTGGA TATCACCAAT
CCGGATGCGC CACGTTTTAT GTGGCGTATT GATGGCAATA GCCCGGGAAT GGATTTGCTC
GGCCAAAGTT GGTCCAAGCC TGTAGTGACC TTTATCCCGG GCAGAGAGTC CAGCCCGGTT
CTGATTTTGG GTGGTGGCTA CAGCCCGTCA AACAAGGATA TCCCCGGCGT GGGTACACCG
GATAATTTGG GTACGGCGGT GTTTATTGTG GATGCCGCGA CCGGCGCTCT GGTGCACGCC
TTTGGCCCCA ATAATGCCGG TAACATGACT GTGATGCCCG GCATCAAAGA CAGTATTCCC
AACGAAGTGG CGGTACTCGA TGCCAATAAC GATGGCCTCA CAGACCGTAT CTACGCCACA
GATACCGGCG GCAACGTGTG GCGCATGGAT TTACCGGGGG CCACGCCCAA AGATGCCAAC
AGGCGTTGGT CTGCCTTTAA GTTTGCCTCC CTTGGCGGCA TGACAACAGG CTCCGACAGG
CGATTTTTTG CAGCTCCCGT GGTGGCCCAA ACGGCATTGA ACAATACCCT GGAATACACC
AGCCGTGAGA AAGGTCGCAC CACCACAGTA ACCACAGTCC AGACCATTCC CTATGATGCC
GTGGTGGTGG GCAGTGGTAT TCGTCCCGCC CCTCAGGATG ACCAGCGTGA AGACATGTTC
TTCACCCTGC AGGACCGCAA TATCGGTATA AGGTCCTTCG ATGGCAGTAA TAAAGACAGG
CTGCCGCCAT CGGCCCTGAC CTTGGCGGAT TTGTATGATG TCACCAGCTC GCCTCCAACC
ACCAAAGAAG AAGAGGTGCG TTTTGGTACG CTGCGGGGCT GGTATTACAA TTTCACCCGT
AAAGGCGAGA AGAGCCTGTC TGCGGGCTCC ATAATCAGGG GGCGGGTGTT TTTTACCTCA
TACGTACCCG GCAGCGCCGG TGCTCCCGGT ACCAATCAGT GTCTCATCCC GGGTAAAGGC
TATCTCTATG GCTTTGATTT GCACAAGGGG ACCCGGTCTT ACAATCAAAC CTATCTGGAA
ATGGGCGAAA GTGTGCCGGA TACACCACAG CTTGTGGTGC CAAGCAGCGA AGCCATGTAT
CTGATTGGCA TCGGCAAGGC GCCTGAATTG ATGGTGAAAA CCCGCTGTGA AGATAACAAC
GAACACTGTG ATGGTTGCCC GCCTGGCGAC GAGAAATGTA TCGGTGGTGG CATGAATACC
CGGAAGATTT ATTACTATGC AAATTGA
 
Protein sequence
MACLLWTPLY ADDTSLYVYE SSNRSDERPQ VLVIFDNSGS MDTTVYGVNP SFSSADGDLS 
ESEQVYYSLD AAEAPPNPAN PAEKRFFTFK RNACASSFEF LKEQGVFTGF MRHYVYSGQT
GSWEEFPRSD GASIRMVECF EDIQDKNYDN GSVAKDGLPV DGEGRRGSPS PFFRVSSGSK
EATKELAMSK AKNTGFGTGR VVTLYTKTYL TWYHSKKKQV NRTRIDIAKE AVTNVLLTTP
GVDFGLAIFN SNVYEGYDDG GRIIAGIKPA TASNKKDLIE AVDLLKGTTW TPLCETLYEA
YRYFSGGEVW FGDDDPTLKP YRDKDAIDAN SRYKSPFKTC QNRAYIVYVT DGEPTRDSNA
NQLVYDLTGG VDAYTSSPAS YLSSLSSWMN TQDVNPNMTG KQSVSTYTIG FSQGAASAAG
LLRHTAEKGG GKYYDATNVD DLQKSLMQVF KNILEKNASF TAPAVASNNF NRIRTFDSVY
YSMFLPNRGP RWSGNLKKFK VTDSGDIIDA SKAKVIDGDG NIAKTACSHW SSSADCSAGD
GNDVRRGGAA GMLQRMQARD RNLLSDVGGL KPLTLSAART KAGGDGALAT LLGVEESEVA
SLIDWARGVD VDDDNDNGDR TEMRADVMGD PLHSKPLAIN FGSEGSPDIR VIVGTNHGVL
HMFKDEGTSV SESWAYLPWE MLPKQATLRE NLPSGRHSVY GIDGSPVAWV KNGASGIQKA
WLFFGLRRGG DAYYALDITN PDAPRFMWRI DGNSPGMDLL GQSWSKPVVT FIPGRESSPV
LILGGGYSPS NKDIPGVGTP DNLGTAVFIV DAATGALVHA FGPNNAGNMT VMPGIKDSIP
NEVAVLDANN DGLTDRIYAT DTGGNVWRMD LPGATPKDAN RRWSAFKFAS LGGMTTGSDR
RFFAAPVVAQ TALNNTLEYT SREKGRTTTV TTVQTIPYDA VVVGSGIRPA PQDDQREDMF
FTLQDRNIGI RSFDGSNKDR LPPSALTLAD LYDVTSSPPT TKEEEVRFGT LRGWYYNFTR
KGEKSLSAGS IIRGRVFFTS YVPGSAGAPG TNQCLIPGKG YLYGFDLHKG TRSYNQTYLE
MGESVPDTPQ LVVPSSEAMY LIGIGKAPEL MVKTRCEDNN EHCDGCPPGD EKCIGGGMNT
RKIYYYAN