Gene Sama_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1597 
Symbol 
ID4603849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1948886 
End bp1952014 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content52% 
IMG OID639780953 
Producthypothetical protein 
Protein accessionYP_927474 
Protein GI119774734 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.229285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTCC CCATTTTAAT GACGCTGTGC CCACAGTTGG CATCAGTGAA GCGTCAGCTC 
AACATGATTT TATTGCCAGG CACCCTTTGC CTGATTGCGC TGCTGAGTTG TTATGCCAGC
GCCTTACAGG CTGAAGAATT ACATCCGTCA CTGGCGCGGA TTAGTCTCGA CAGCGAATGG
CGCAGCGGCG CGTATCAAAG CCAGGGTGTC GATTATACCC CCGCCGCATC CTGGGTTGAT
CCACTGGACC TCATGGTTCC AGAGACCATG CCTGACGATC AAATCCGTGA CGGTCTCTAT
AATCTGGTGG TCGATAACCA ATACAAGGTG GATGCCGACG GTAAGAAAGT GCAGTTCAGC
CATTATGCCG ATGTGGTGAC TGCACCCAAG GGACTTGAGT CCGTATCGCA AATTCAGATT
GAATTTGACC CCAACTATCA GCGGCTACAA CTGCACTCTA TCTGGGTGCA CAGGGCTGGT
GAGCTTATCG ATAAGAGTCT GGATGCCGAC TTTCAGCTTG CCCGCAGTCA AAACACAGAC
GATCTGATGT ACGACGGCAG TATGGTTGCG ACCTGGGTTC TTAAAGACAT TCGGGTTGGC
GACATACTGG AATACAGCTA CAGTCGGCTC GGCAGCAACC CGGTATTTGA AGGGCGCTTT
TTCGGTGCCC GCAGTTTGCA ATGGAGTGTA CCTGTTGCCC GTCAGCAGCT GCGCATACTG
TGGGGCAAAC CTAAGCCGCT GAACATTAAA GTGGAAAACA GCGAACTGCA GTTTACAAGT
ACGCGGCTTG GTACTCAGAC AGACTACCGC TTACAAGTGG AACATATGCC TACCATCACT
GCGTCGAGTG ATGCAGCCCC CTGGTATAAC CCCTTTGCCA AAGCCTATTT CTCTGAAACG
AATTCCTGGA AGCAGGTAGT AGACTGGGGC TTGCCTCTCT TTAGCCGTCA GCAGCAAGTG
AATGAAGACA TCACGATGCT GGCCAGACAG ATAACGGCCA AACACGACTC GAAAGCTGAA
CAACTGGTCG CCGCACTGTC CCTGGTGCAA GAAAATGTGC GCTATCTGGG GCTGGAGATG
GGTAGCAACA GCCACTTCCC CTCGGCCGCC GGTGACACCC TCAGCCGCCG TTATGGCGAC
TGTAAAGATA AGGCAGTGCT GTTGGTCACT CTGCTTGGAG AGCTCGGCGT TGAGGCATCT
CCTGCACTTG TGAATACCGA CACGGGCAAA ATGTTGCCTG CCATGCTGCC ATCAGCAGAC
CGCTTTAACC ACGCAATTGT GCGTGCCAGT CTTAATGGCA ATAACTATTG GCTGGATCCG
ACATTGATGG ATCAGGGTGA TGCTCTGGAA CACTTATATC AGCCGGATTA TGGTTTTGCC
CTGGTGCTGA GCGAAGACAG CCAAGAATTG GTTTCCATGG CGTCTGGTGC CAGTGGCAGT
CGCATCCGTT TTACTGAAGA GTATGATTTC TCTGAAGGCA TAAGCGGGGT GGGGAAGGCT
AACATCAGTA CCTTATATCA GGGGGACGAA GCACGGCGGA TGCGTGGTCG CATTGCTTCA
GAAGGATTAA AGGCACTGTC GGAACAATAT GCCGGGTACT ATGGCAAGCA ATTTAAGGGA
CTTGAAGCGA TAAGGCCCCT CGATGTGTTG GACGACAAGG CCACCGGTGA GGTGCGCCTG
GAGGAGGCCT ATCATTTGCC TGCACCCTGG CAGGCCGATG ACGATGGCGG CTACACGGTG
TATGTGTATG AATCGCAAAT CTCCCCCTAT CTGACCTTGC CTCAGGGACA TGGTCGCAGG
GAGTTGGCCC TGAAACATCC TCTGCAAATT GAAGGAACAA TCCGACTTAA GTTTCGTGCC
AATGACTGGC AGTTTGATGC CGCTGTGAGG GAAGAATCCA ATCCCTTCTT CGACTTCCAC
TACAGCGTGG CCTTTGATAA AGAGTCCAAC CTCCTGACCC TGGCGTATCG TTATCAAAGC
AATACCGACC GGGTACCGGC AGACAGAGTT GATGAGTATA TTGCCGCCCT GAAGAAAGTC
GCCGATACCG ACAGTTACGG CATCATCGAT TACAACAGTG AAGTAAAAGA TGCGGTGACT
CAGGAAGCCG ATGCCAGCGA TGATGATACG AATATGGCGT TGATTGTCCT CAGTGCGGGC
TTTGTTGCGC TTTGTGCCAT AGGTTTTGCC ATGGCAAGTT GGCTGATTGA CGATGGCCAT
AAAGCAAGCA GCCGTTACTA TCCGGTATCA CCACTCAAAG CTTTGCTGCT GTCACTGCTT
ACTTTTGGCC TTTACCCCTG CTATTGGGCC TACAAGAACT GGCAATATGT GCAGAACGAG
CTTGAGCCTG GCATCTGGCC CATGGCCAGG GCTATCTTTG CGCCGCTTTG GTATTACCAC
CTATACCTGT CGTTGGCGAA TAGCGGCGAT GAAGACAAAC GACGCTGGCT CATCCCTGGT
TGGCTGGCAG TCCTTTTTAT GATCCTTTAT ATCGGTGTTT CTGCAACCAA AAAGCTTCAC
GGCATGACAG TGTTATCACT GCTTGTTCCG GCTCTGGTCA TTTTACCTCT AATCACCTAC
ATCAATAGGC TCAATGGCCA TAACGAGGCC TATGTTTACA ATAGTCGCTG GCGCTGGCGC
CATTGGCTGC TGACAGTGGC AACACTGCCA CTTTTGCTTT TCGTGTTGGC ACAGGAGGCA
GGGTTCACCG CCTCTGACAG GGTTGTCGCA GGCAGCGATT TATGGCAGCG GGACATCAAG
TTTATGAAGC GCAAAGCCAT CATTGCCGCC AACGAGAACC CGGTGCTCTT CTATTCGGAT
GATTGGCTAA GCAACCAGAG TGATGGTAAT GGCTTTACCG ATGCCAGGAT ATTCTCTTAC
TGGCGTGAAG ACGGTGTCTT TTACCAGGAA ACAGTGCCAT TTGAAGCGGT AAAAAATATC
GAGGTGAACT TTGCCAAAGG TGACGAATTA ACGACTACCA TCACTGTATA TCGGGATGAT
GGCAGTCACT TTTTACTGTT TGCGGCCACG GAGGCCAGAA AGGATAGGCA GTTTGTGCGT
GAACTTCTGG AGCGTTGGCG CAGGATAAGG CCCCAGGGTT CAGATAACAA CGGAGAGCAG
GAACAATGA
 
Protein sequence
MLLPILMTLC PQLASVKRQL NMILLPGTLC LIALLSCYAS ALQAEELHPS LARISLDSEW 
RSGAYQSQGV DYTPAASWVD PLDLMVPETM PDDQIRDGLY NLVVDNQYKV DADGKKVQFS
HYADVVTAPK GLESVSQIQI EFDPNYQRLQ LHSIWVHRAG ELIDKSLDAD FQLARSQNTD
DLMYDGSMVA TWVLKDIRVG DILEYSYSRL GSNPVFEGRF FGARSLQWSV PVARQQLRIL
WGKPKPLNIK VENSELQFTS TRLGTQTDYR LQVEHMPTIT ASSDAAPWYN PFAKAYFSET
NSWKQVVDWG LPLFSRQQQV NEDITMLARQ ITAKHDSKAE QLVAALSLVQ ENVRYLGLEM
GSNSHFPSAA GDTLSRRYGD CKDKAVLLVT LLGELGVEAS PALVNTDTGK MLPAMLPSAD
RFNHAIVRAS LNGNNYWLDP TLMDQGDALE HLYQPDYGFA LVLSEDSQEL VSMASGASGS
RIRFTEEYDF SEGISGVGKA NISTLYQGDE ARRMRGRIAS EGLKALSEQY AGYYGKQFKG
LEAIRPLDVL DDKATGEVRL EEAYHLPAPW QADDDGGYTV YVYESQISPY LTLPQGHGRR
ELALKHPLQI EGTIRLKFRA NDWQFDAAVR EESNPFFDFH YSVAFDKESN LLTLAYRYQS
NTDRVPADRV DEYIAALKKV ADTDSYGIID YNSEVKDAVT QEADASDDDT NMALIVLSAG
FVALCAIGFA MASWLIDDGH KASSRYYPVS PLKALLLSLL TFGLYPCYWA YKNWQYVQNE
LEPGIWPMAR AIFAPLWYYH LYLSLANSGD EDKRRWLIPG WLAVLFMILY IGVSATKKLH
GMTVLSLLVP ALVILPLITY INRLNGHNEA YVYNSRWRWR HWLLTVATLP LLLFVLAQEA
GFTASDRVVA GSDLWQRDIK FMKRKAIIAA NENPVLFYSD DWLSNQSDGN GFTDARIFSY
WREDGVFYQE TVPFEAVKNI EVNFAKGDEL TTTITVYRDD GSHFLLFAAT EARKDRQFVR
ELLERWRRIR PQGSDNNGEQ EQ