Gene Ssed_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1020 
Symbol 
ID5609947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp1213148 
End bp1214638 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content51% 
IMG OID640931868 
Productcarboxypeptidase Taq 
Protein accessionYP_001472759 
Protein GI157374159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.16508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC CCAACCTAAG CCCCCATTAC GACAGGCTCA CAAAACATTT TCAGACAATC 
TCTCATTTCG AACACCTGAG TGCTCTGGGT GACTGGGATC AGGCGACGAT GATGCCAGTC
GGCGGAGGCG CCGCCCGCGG CGCTGCGATG GCTGAGTTAG CCAAACATAT TCACGAGCTA
AAAACAGCCC CTTTCCTTGC CGACACACTA CAACTTGCTC AAGATGAGAT GCTAACTCGC
GAGCAAGCTG CGAACCTGAA AGAGATGAAT TATCAATTTT TTCAGGCCAA CGTGATCCCC
GCATCACTGG TTAAGGCTAA GACTGAGCTC GCCTACCGAT GTGAACATGC CTGGCGTGAT
CAGCGCAAGA ATAATGACTG GCAGGGGTTC AGACCTAATC TCGAAGCCTT GATGGCACTC
GTTAAAGAGG AGGCGAATAT CCGTTCACAG GCACAAGGCT TATCACCTTA TGATGCCCTA
CTGAACAAAT TTGAACCAGG CATGACGACC GAGCGTCTCG AATCGGTATT TGGTCACCTT
AAAACCTGGC TGCCCTCGCT TATTCAGCGA GTTCAGCATG AACAGGCCAA AGAGCACAGG
TTTAATATCG AATCCTGCGG CAGTCAGGCT CAGGAGACAC TGGGACGAGA GGTGATGGAC
TTTCTCGGAT TTGATTTTAC TCAGGGTCGA TTAGATGTCA GTAGCCATCC CTTTTGTGGT
GGGGTGCCCG GTGATGTTCG CCTGACGACC CGCTACGATG AGTCCGATTT CACCAGCGCC
TTAATGGGGG TTATCCATGA GACGGGCCAT GCCAGATATG AGCAGGGGTT ACCGGTTAAC
TGGCGAGGAC AGCCTGCCGG CCATGCTCGC TCGATGGCTA TCCATGAGAG CCAGAGTCTG
TTCTGTGAAA TGCAACTGGG ACGCGGCAGC GGATTCCTCT CCCATTTACA ACCCAAAATA
GCCAAACACT TAGGTAGCCA ACTTTCAACG GAGCAACTGA CCAATATCTA CACCCGGGTT
AATCCTGGTC TTATCAGGGT CGATGCCGAT GAGATCACCT ACCCTTGTCA TGTCCTACTC
AGATTCGAAG CCGAGAAAGG CTTAATCGAT GGCAGTCTCA GTGTCGCCGA TCTGCCAGAA
TTCTGGGCCC AGCAGATGAG TTCGTTATTA GGCGTTAACA CCCAGGGCAA CTTTAAAGAT
GGTTGCATGC AAGATATACA CTGGGCCGTG GGCGAACTTG GATACTTCCC CAGTTACACC
TTAGGCGCTA TGTATGCGGC TCAATTTCGT TTTGCCATGG AGGCGAGCTT AGGCTCGGTG
GACACCTTGG TTGCCCAGGG AAATATCGCT CAAATATTTG AGTGGCTGGA ACAGAAAATT
TGGTCACAGG GAAGCCTGTT AAATACAGAC GAACTGGTCA AACAGGCCAC AGGCGAAACT
CTGAACCCCG ATTATTTCAA ACGACACCTG GAGCAAAGGT ATCTGAAATA A
 
Protein sequence
MTQPNLSPHY DRLTKHFQTI SHFEHLSALG DWDQATMMPV GGGAARGAAM AELAKHIHEL 
KTAPFLADTL QLAQDEMLTR EQAANLKEMN YQFFQANVIP ASLVKAKTEL AYRCEHAWRD
QRKNNDWQGF RPNLEALMAL VKEEANIRSQ AQGLSPYDAL LNKFEPGMTT ERLESVFGHL
KTWLPSLIQR VQHEQAKEHR FNIESCGSQA QETLGREVMD FLGFDFTQGR LDVSSHPFCG
GVPGDVRLTT RYDESDFTSA LMGVIHETGH ARYEQGLPVN WRGQPAGHAR SMAIHESQSL
FCEMQLGRGS GFLSHLQPKI AKHLGSQLST EQLTNIYTRV NPGLIRVDAD EITYPCHVLL
RFEAEKGLID GSLSVADLPE FWAQQMSSLL GVNTQGNFKD GCMQDIHWAV GELGYFPSYT
LGAMYAAQFR FAMEASLGSV DTLVAQGNIA QIFEWLEQKI WSQGSLLNTD ELVKQATGET
LNPDYFKRHL EQRYLK