Gene Sama_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2533 
Symbol 
ID4604780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3042011 
End bp3043036 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content52% 
IMG OID639781928 
Productthiamin biosynthesis lipoprotein ApbE 
Protein accessionYP_928405 
Protein GI119775665 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.903036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA CCTTATCTAA CTGGCTGGTC CTCGTCGGAC TGGCCTTTTT TATTTCAGCT 
TGTTCTAAAG CACCTGAGGT TGTCAGCCTG TCTGGCAGCA CCATGGGCAC CACGTATCAC
ATCAAGGTGG TTCCCAGTGA CGCCTTACCC CAAAGCCAGC TGCTGCAGGC AGAAATTGAC
CTGGCGTTGG AGCGCGTGAA TGACCAGATG TCTACTTACC GTCCCACCTC TGAGCTGAGC
CGCTTCAATC AGTTGCCATT GGAGCAGGGT GTTGAGGTGT CGGATGACAC TATCAAGGTG
GTCCGAGAGG GAATTCGCTT AAATGAGCTA ACCGACGGCG CGCTGGATAT TACTCTGGGC
CCACTGGTCA ATATCTGGGG GTTTGGACCG GACAAGCGAC CAACTAAATC CCCCACCGAA
GCTGAAATCG CCGACGCCAA GACTCGCACC GGTATTCAGA ACATCAGCAT TGAAGGTAAC
CGCCTCTTTA AGCGCAACGC TCACCTTTAT GTGGATTTGT CATCCATTGC CAAGGGCTAT
GGCGTGGATG TGATTGCCGA TTTACTGGAT AAGTATCACA CCAGCGGTTA TCTGGTAGAA
ATTGGCGGTG AACTGCGTAT CAAGGGCACC AAGGGTGATG GTAGCAGCTG GCGTGTAGCC
GTTGAAAAGC CACAGGCTGA AGGCCGTGCG GTGTCTCAGG TAATAGAACC CGGTGATATG
GGTATGGCTA CCTCTGGCGA TTATCGCAAT TACTTCGAAG AAAATGGCAA ACGCTTCTCA
CATCTGATAG ACCCAAGGAC CGGTTATCCC ATCGAGCACA CCCTCGCATC TGTGACAGTG
CTGCATCCAA GCTGTATGAC TGCCGACGGC CTGGCGACGG CCATGATGGT GCTCGGCACG
GAAGCGTCAT TGATTCTTGC CAAACAGCAG GGACTGGCGA TAATGCTGAT TGAAAAACAG
GGCGAAGAGT TTGTGGTACA CTACAGCGAC GCATTTTTGC CCTTCGTTAA GTCCACTCAG
GAGTGA
 
Protein sequence
MKKTLSNWLV LVGLAFFISA CSKAPEVVSL SGSTMGTTYH IKVVPSDALP QSQLLQAEID 
LALERVNDQM STYRPTSELS RFNQLPLEQG VEVSDDTIKV VREGIRLNEL TDGALDITLG
PLVNIWGFGP DKRPTKSPTE AEIADAKTRT GIQNISIEGN RLFKRNAHLY VDLSSIAKGY
GVDVIADLLD KYHTSGYLVE IGGELRIKGT KGDGSSWRVA VEKPQAEGRA VSQVIEPGDM
GMATSGDYRN YFEENGKRFS HLIDPRTGYP IEHTLASVTV LHPSCMTADG LATAMMVLGT
EASLILAKQQ GLAIMLIEKQ GEEFVVHYSD AFLPFVKSTQ E