Gene Sama_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2117 
Symbol 
ID4604367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2560706 
End bp2561896 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content56% 
IMG OID639781502 
Producthypothetical protein 
Protein accessionYP_927992 
Protein GI119775252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATC CAGGATTGCG TTCTGCACTG CTGATGCTGC AACCCAAACT GCCGTTACTT 
TTACTTGTGG CGCTCACGGC CTGTGGTGGT GAAGGTGAGA GCACAGCGCC CACAGCGCCG
CCCTCAATTG TCGTAACGCC GCCTGCCACT TCGGTGTGCG ACAACAATAA TGGCAGCATC
AATCATGAGG CCCTGATGAG CACAAACTGT GCGCGCTTGT CTGACTACCG GCTTTTCAGC
GACAGCCGCA ACCCGGCTCT GGCACCCCAT ACGCCGGGTG TGGCCTACCA GCTCGCCAGT
GAGCTTTTTA CCGACTATGC CATTAAACGC CGCTTTATCT TCTTACCCGA AAACAAGCCC
ATGGTGTTAC AAGGAGATGC CCTGTCGTTG CCGGTGGGCA CCGTGTTGGT TAAAAGCTTC
TTACTGCCTT CAGATACCTC AGACACCAAT GTTTCTGCTG CCCGGCTTAT CGAGACCCGG
CTGCTTATCC ACCGCGAGAG TGGATGGACG GCGCTGCCCT ATCTGTGGAA TGGGGATGAG
GCGTACCTTG CCGAAACCGG TGCCGATGTA TCCCACAGTA TGAATAGCAC AAATGACACG
CTGAACTTCA CCTATCACGT GCCCAGCCGC GCCGAGTGTA AGATTTGCCA TCAAAGCGCC
CGGAATGGCC TGACCACCAT AGCGCCCATA GGTCCCAAAC CGCTGCTGCT GAACAAAGCC
ATCACCGTTA ATGATGAGTC GATTAATCAG CTGACATGGT TTGCATCCCA GGGGCTGCTC
ACAGGGCTTG GCGAGATTGA CTCACTGCCA CAGACCTTTG CCATTGGGGA TGAGCAGCAA
AACCTTACCG CGCGGGTGAA AGGCTATCTC GATGTGAACT GTGCCCATTG CCACAAGGCC
GACGGCTTTG CCAGCGTATC GGGACTCAGG CTCGGGTTCG AAACCGATCA TCACAGCTAT
CAGTACGGAA TTTGTAAGCA GCCGCCCGGT TGGGATGGCG GCGAGCGGGG GCTTTCCTAC
GACATAGTGC CGGGCAACGG CGACCACTCG ATTCTGGTCT ATCGACAGAC GCTTTCAGCG
GCCAAAGACC GCATGCCGCC AGTGGGACGA GCTTTGGTGC ACAGTGAAGC CGTAACGCAG
ATCAGTCGCT GGATAGACTT GATGGCCCCG TCGGTGGGTA ACTGTCAGTA G
 
Protein sequence
MSNPGLRSAL LMLQPKLPLL LLVALTACGG EGESTAPTAP PSIVVTPPAT SVCDNNNGSI 
NHEALMSTNC ARLSDYRLFS DSRNPALAPH TPGVAYQLAS ELFTDYAIKR RFIFLPENKP
MVLQGDALSL PVGTVLVKSF LLPSDTSDTN VSAARLIETR LLIHRESGWT ALPYLWNGDE
AYLAETGADV SHSMNSTNDT LNFTYHVPSR AECKICHQSA RNGLTTIAPI GPKPLLLNKA
ITVNDESINQ LTWFASQGLL TGLGEIDSLP QTFAIGDEQQ NLTARVKGYL DVNCAHCHKA
DGFASVSGLR LGFETDHHSY QYGICKQPPG WDGGERGLSY DIVPGNGDHS ILVYRQTLSA
AKDRMPPVGR ALVHSEAVTQ ISRWIDLMAP SVGNCQ