Gene SO_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_1109 
SymbolapbE 
ID1168942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp1152199 
End bp1153242 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content44% 
IMG OID637343060 
Productthiamin biosynthesis lipoprotein ApbE 
Protein accessionNP_716735 
Protein GI24372693 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA CCCTAGCGTT AAAAACCTCA ATGTTTAAGC ATTCAGTTAG CTGGCTGGTC 
CTTGTAGGTC TGGCCTTTTT TATTTCAGCC TGTAGCAAAC AGGATGAGGT TATTTCCCTT
GCGGGCAGCA CAATGGGAAC GACTTACCAC ATTAAAGTGG TTCCCAATGA GCATATGCCG
ACAGCCCAAC TATTGCAGGC TGAAATTGAC TTAGCCCTAG AACAAGTCAA TAACCAAATG
TCGACCTATC GTCCTAATTC CGAGCTTTCC CGTTTCAATC AATTACCGCT AGAGCAGAGT
GTGGAAGTCT CTCCCGATAC CATTAAGGTA GTTAGAGAAG GTATGCGTTT ATATGACGTT
ACCGATAAAG CGTTAGATAT TACCTTAGGT CCTTTAGTTA ATCTGTGGGG TTTTGGACCT
GATAAGCGCC CGACAAAAGT ACCGAGTCAA GCCGAGATTG ATGCAGCTAA GGCCAAAACG
GGTATTCGTG AACTGTCTAT CGAAGGCAAT CTTTTACGTA AACATAATGC ACACTTGTAT
GTGGATTTAT CGTCAATCGC TAAGGGCTTT GGTGTTGATA AAGTTGCTTC GATTTTAGAT
AAGTATCAAG CAACGGGTTA CTTAGTTGAA ATCGGTGGCG AGCTGAGCAT TAAAGGCACT
AAAGGCGATG CTAGCTCATG GCGTGTGGCA ATAGAGAAGC CGACCGATGA AGGTATAGCT
GTGCAGCAGG TGATTGAACC TGGCACTATG TCTATGGCAA CGTCGGGAGA TTATCGCAAT
TATTATGAGG AAGCAGGCCA ACGTTTTACG CATATAATTG ATCCGCGTAC CGGTTTGCCT
ATCAATCATA AGCTAGCATC TGTCACCGTT TTGCATAACG AATGCATGAC GGCAGATGGT
TTTGCGACGG CAATGATGGT TTTGGGTACA GAAGCATCAT TGGAGCTTGC CAAGAAGGAA
CACTTGGCGA TAATGCTAAT AGAAAAGCAA GGTGAAGGAT TTAAAGTCTA CTACAGCGAC
GCCTTCAAGC CTTTCCTTAA GTAG
 
Protein sequence
MFKTLALKTS MFKHSVSWLV LVGLAFFISA CSKQDEVISL AGSTMGTTYH IKVVPNEHMP 
TAQLLQAEID LALEQVNNQM STYRPNSELS RFNQLPLEQS VEVSPDTIKV VREGMRLYDV
TDKALDITLG PLVNLWGFGP DKRPTKVPSQ AEIDAAKAKT GIRELSIEGN LLRKHNAHLY
VDLSSIAKGF GVDKVASILD KYQATGYLVE IGGELSIKGT KGDASSWRVA IEKPTDEGIA
VQQVIEPGTM SMATSGDYRN YYEEAGQRFT HIIDPRTGLP INHKLASVTV LHNECMTADG
FATAMMVLGT EASLELAKKE HLAIMLIEKQ GEGFKVYYSD AFKPFLK