Gene Sama_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0454 
Symbol 
ID4602709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp559471 
End bp561156 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content54% 
IMG OID639779790 
ProductMSHA biogenesis protein MshL 
Protein accessionYP_926334 
Protein GI119773594 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02519] pilus (MSHA type) biogenesis protein MshL 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CTGGTATCAA ATACCTTACC CCTCTGCTGT CGCTGTGCCT GATAGCATGC 
CAAACCACTG ACAGGCCCAA CCCTCAGGCG TCCAAAGAGG CGCTCAGGGA GGTTGTGGCT
CAACAAAATG CACAAGCTCA GCCACCGGCC AAATTGCCCG ATTCAGTGTC CCGCGAATTG
GCCGGCAGTA ACACAGTATT TGCGCCAACG CTGCCACCGG AGCGCCGTTT CGATGTGGCT
GCCAATGCGG TGGATGCCAG AGTGTTTTTT CCGAGTTTGG TCAAGGGAAC GCCTTTCAGC
GTGGCCGTAC ACCCCGATGT GCAGGGACGG ATTTCTCTGT CGCTCAAAGG CGTGACCCTG
AGTGAGGCCT TGCAGGTGAT TGAAGACCTG TACGGCTATG AAGTCAGCCA TGAGGGCAAG
GTGCTAAAGG TATTCCCATC GGGCATGCGA ACCGAAACGT TCCCGGTGAA CTACCTCTAT
ATGGAACGCA TGGGGGTGTC TCTAACGTCT GTGACGTCCG GCCGTATTTC CGACAACAAT
AACAACAATA ACAACGGCAA TAACAACAAC GGCAGCAATG GTAATAACGC TTTTGATAAC
GGCGGGGCCA ACAATAACGG TGTCAACGGC AACAATACCA ATGGTAACAA TACCAACGGC
ACCTTTATCC AGTCCCGCAA TAAAACAGAC TTCTGGGGTG AACTGAAAGA AACCCTTGAG
TCCTTGATTG GTGGCACCAG TAACAACCGC AGTGTGGTGG TCACACCCCA GGCCGGCTTG
GTGACTGTGC GTGCGCTGCC GGGTGAGCTG CGTCAGGTGA GGGAGTTTTT GGCCACCGCC
GAAACCCATC TGCAGCGTCA GGTGATTTTG GAGGCCAAGG TGCTTGAGGT GACCCTGTCT
GATGGTTATC AGCAGGGTAT CCAATGGAAC AAGATTGCCG GCAGTGCACT GGCCGATGGC
AACACCAAAA TTAATTTCGC CACTTCAGCA GGCAATGAGT TCGGCAATCA AATTTCCAGT
GCCTTGGGCG GCGTAACCTC GTTGTCCATC ATAGGTTCAG ACTTCGACGC CATGATAAAT
CTCCTCGATA CTCAGGGGGA TGTGGATGTG TTGTCGAGCC CCCGTGTCAC CGCCTCCAAC
AACCAAAAGG CGGTGATCAA GGTGGGCAAG GATGAATACT TTGTCACTGA TGTGTCATCC
ACCACAGTGG CGGGCACCAC ACCTGTGACC AGCCCCGAAG TCGAGCTGAC ACCCTTTTTC
TCCGGTATCG CACTGGATGT TACCCCACAG ATTGACGGCC AGGGCAATGT GCTGCTGCAT
GTGCATCCCT CGGTTATCGA TGTGAAAGAA CAAACCAAGA CCATCAAAAT CAGCAACAGC
GATCTGGAGC TGCCCCTGGC CCAGAGTGAG ATCCGCGAAT CGGATACCGT GATTAAAGCC
ATGTCGGGGG ATGTGGTGGT GATTGGGGGT CTGATGAAGA GTGAGAGCTT GGAGCTGGTG
TCCAAGGTAC CTCTGCTTGG GGATATTCCG TTTCTGGGTG AAGCTTTTAC CAACCGCAGT
CAGTCTGTTC GTAAAACGGA GCTGGTGATA CTGCTCAAAC CAACCGTGGT CGTAAGCGGC
ACCTGGCAGA AAGAGCTGGA GCGGTCAAAG GCCTTGTTGG ACCGTTGGTA TCCCGAGGGC
GAATAA
 
Protein sequence
MAISGIKYLT PLLSLCLIAC QTTDRPNPQA SKEALREVVA QQNAQAQPPA KLPDSVSREL 
AGSNTVFAPT LPPERRFDVA ANAVDARVFF PSLVKGTPFS VAVHPDVQGR ISLSLKGVTL
SEALQVIEDL YGYEVSHEGK VLKVFPSGMR TETFPVNYLY MERMGVSLTS VTSGRISDNN
NNNNNGNNNN GSNGNNAFDN GGANNNGVNG NNTNGNNTNG TFIQSRNKTD FWGELKETLE
SLIGGTSNNR SVVVTPQAGL VTVRALPGEL RQVREFLATA ETHLQRQVIL EAKVLEVTLS
DGYQQGIQWN KIAGSALADG NTKINFATSA GNEFGNQISS ALGGVTSLSI IGSDFDAMIN
LLDTQGDVDV LSSPRVTASN NQKAVIKVGK DEYFVTDVSS TTVAGTTPVT SPEVELTPFF
SGIALDVTPQ IDGQGNVLLH VHPSVIDVKE QTKTIKISNS DLELPLAQSE IRESDTVIKA
MSGDVVVIGG LMKSESLELV SKVPLLGDIP FLGEAFTNRS QSVRKTELVI LLKPTVVVSG
TWQKELERSK ALLDRWYPEG E