Gene Sama_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1101 
Symbol 
ID4603353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1324974 
End bp1326563 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content57% 
IMG OID639780448 
Productaminopeptidase 
Protein accessionYP_926978 
Protein GI119774238 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.252939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCGA CCTCCCGCCT GTTTTTGGGC CTCGGACTGT GCCTGTCTGC CCAGGCCTTT 
GCCGCCCCGC TGACCTTTGA TGAAACCGCA TTCCGCGAAG ACGTGAAGAC CCTTGCCAGT
GACGCCTTTG GCGGCCGCGC GCCCCTCTCC GATGGTGAGC AAAAGACCCT CGATTACCTC
ACCCATGCCT TTAAGTCGAT GGGCCTGAAA GGCGCTTTCA ATGGCGAATA TTTGCAGGCA
GTGCCGATGG CGAAAATCAC CGCCGATCAG AGCATGGTGC TTAAGGTGGG TGAACTCAGC
TTTACCTCGG GTGAGGATTT CACTGCACGT ACCCAGAGGG TGGTACCCAA GGTAGAACTG
AGTGGCAGCG ACATGGTGTT TGTCGGTTAC GGCATCAATG CCCCCGAATA CGGCTGGAAT
GACTACGCAG GTATCGATGT GCGCGGCAAA ACCGTGGTGC TGCTGGTTAA CGACCCGGGC
TTTGCCACCC AGGACCCCAA GGTCTTCAAA GGCAACGCCA TGACCTACTA CGGCCGCTGG
ACCTACAAGT ATGAAGAAGC CGCCCGTCAG GGGGCAGAAG CCGTGTTTAT CGTCCATGAA
GATGCTCCGG CGGCGTACGG CTGGGGCGTG GTGAAAAACT CCAATACCAA TACCAAGTTC
ACTTTGGTTG ATGGCAATAA CAACCAAAGT CAGGTGGGCG TGATGGGCTG GCTGCAATAT
GCGGCGGCCA AGCAGATTCT GGCGGCTTCC GGCCAGGATA TTGAAGCGCT GAAAGCCGCA
GCCAAGGCGC CGGGCTTTAA AGCCGTGCCC TTGACGGTGC AAGCCGATTT GACCCTCAGT
AATCATATCG AGCGCGCCGA GTCCCATAAC GTGGCCGCCA TATTGCCCGG CAACAAAAAT
GCCGATGAAG CTGTGGTGAT GCACGCCCAT TGGGATCACC TTGGCCAAAT CGAGGAAGAG
GGCAAAACCA TCATCCTCAA TGGTGCCGTG GATAACGCCA CCGGCGTGGC CGGGGTACTG
GCGCTGGCAA GACACTATGC TGCCTTGCCA GAGGCAGAAA AGCCCGCCCG CAGCATGATT
TTTTCCGCTT TCACTGCTGA GGAAACCGGC CTGATTGGCG CCCAGTATTT TGCTGAAAAT
CCGCCGTTGC CGACATCTAA GCTGGTGGCT TTTTTAAACA TTGATGGCAT GAATGTGGGC
GAAGGCGTGG ATTACATATT GCGCTACGGT GAAGGGGTCT CTGAGCTGGA AACTATGCTC
AGTGACGCCG CCAAGGCCCA GAACAGACAG GTGAAGGCCG ACCCACGACC TCAAAATGGC
CTGATGTTCC GCTCGGATCA TTTTGCTCTG GCGCAGCAAG GGGTGCCCGG ACTGCTGTTT
ATGAGCCTGG GTGACACCGA CCCTGACTAC ATTGCCCACA AGTACCACAA GGGCGCCGAC
GATTACTCCC CGGACTGGCA ACTTGGTGGT GTAAAGCAGG ACCTTAAATT GATTGAGCAA
ATTCTTTCGC GCCTTGCCAA TGGCAGCGAA TGGCCCAAGT GGCTGGAAGA GTCTGACTTC
AAAGCCCGCC GTGCCAAAGA TGGCCGTTAA
 
Protein sequence
MNPTSRLFLG LGLCLSAQAF AAPLTFDETA FREDVKTLAS DAFGGRAPLS DGEQKTLDYL 
THAFKSMGLK GAFNGEYLQA VPMAKITADQ SMVLKVGELS FTSGEDFTAR TQRVVPKVEL
SGSDMVFVGY GINAPEYGWN DYAGIDVRGK TVVLLVNDPG FATQDPKVFK GNAMTYYGRW
TYKYEEAARQ GAEAVFIVHE DAPAAYGWGV VKNSNTNTKF TLVDGNNNQS QVGVMGWLQY
AAAKQILAAS GQDIEALKAA AKAPGFKAVP LTVQADLTLS NHIERAESHN VAAILPGNKN
ADEAVVMHAH WDHLGQIEEE GKTIILNGAV DNATGVAGVL ALARHYAALP EAEKPARSMI
FSAFTAEETG LIGAQYFAEN PPLPTSKLVA FLNIDGMNVG EGVDYILRYG EGVSELETML
SDAAKAQNRQ VKADPRPQNG LMFRSDHFAL AQQGVPGLLF MSLGDTDPDY IAHKYHKGAD
DYSPDWQLGG VKQDLKLIEQ ILSRLANGSE WPKWLEESDF KARRAKDGR