Gene Sden_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSden_3685 
Symbol 
ID4020242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella denitrificans OS217 
KingdomBacteria 
Replicon accessionNC_007954 
Strand
Start bp4417856 
End bp4419337 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content53% 
IMG OID637957744 
Producttriple helix repeat-containing collagen 
Protein accessionYP_564681 
Protein GI91795030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGA TTTTTCACTC GTTAACCACA GTCGCCCTCT TGGGCTTAAG TTCAGCATCG 
GCCGTGGCAT CAAACTGGGA TGCCCATTGG TCCTTTGCCG GCCATGGTAA TGACAACTTT
AATGGTGCCC ATCATGGTGC CTTCCCGTTA AGCGGCACTG GCAGTTTCGA ATTCGGCAAG
CTTAATCAGG CCCTTTATTA TGACATTCTT AAAGTGGACT CCTTGAGCAT TCAAAAAGCG
GCTGAGCCTT TCTCTTTGTC CTTCTGGGCC TTGAGAGAAA CCCGTGATTA TGCCGAGACC
TTATTCTCTA AACAGCAGTC GCCATTCAGC GCCGGTATGT CGGTCAGCCT CGATGGTGCC
AATGCCATAG TGGTGGATAT TCGTAATGGT CAAGGCGGCA GTATCAAGAT AAAGAGTCAG
CAAGTCTGGA CCGATATGAA TGATTGGCAT CATTTAGTCG TGACCTACAA TGGCTCAATT
CAGGCAGGTG GCATTAGTTT GTACCTTGAT AATCAACGGC TCGATGTCGA TGTTATCAGT
GACAACTTAA CCGGAGACAT AGGGGCAAGT GACCCTGTCG TTATCGGTGC TGACAGCGAA
ACTAGCTACG CCACCTTCAA CGGTGCCATC GATGAAGTCT ATTTAGGCAG CCGCGCCTTT
AATAGCAGTG ATATCGAGTG TCTGTATGCA CTGAGAGACA ACTGTGTCGA TCCCATTAGT
GACGAGCCTC CAGTGATTGC CCCTCAAGGG CCACGGGGCT TTGAAGGCCC CGCTGGGCCA
CAGGGTCTAC GGGGGGCAAC AGGCTCGCAA GGTGTTAAGG GAGCGAAAGG CCCAATTGGC
GACCCAGGCG TCAAAGGCCC GCAAGGCTCG CAAGGCAGCA AAGGCGATCT GGGCTTGAAG
GGCCTAACAG GCATCGCGGG TAATCCAGGT GTTGATGGCC GTAATGGCCG TGATGGTAAA
GATGGCAACG CGGGCTTAAT GGGCTTACAA GGCCCAGATG GCCTGCAAGG CCCTAAAGGA
GTCCGGGGGG ATACTGGCCC GATGGGACCA CAGGGCGATC CTGGACCTCA AGGCTTAAAA
GGGGCGACTG GCGCCAAGGG CGTCACTGGA GCGACTGGCG ATGCTGGCCC ACAGGGCTAT
GCCGGTGCAC CTGGCATTCC TGGTGCTGAT GGTGTTCAAG GTAACCCAGG TTTACCAGGT
TTGAAAGGGA ACACTGGATC TTCTGGGCCC TCAGGCGCTC CTGGCGGCAA AGGCCCCAAA
GGGCTAAAAG GCCCTCATGG GGATGTTATT CCTGGTGATA ATGCCCCTGC ATTTGGTCCA
CAGGGGCCAA GAGGCCCAGA TGGCTACAAT ACTTATTACA GCACTGGTGG ACGTAATTTT
AGCATGGGAA CAGTACTCTC TACGGAAAAA GCCCTGAGTC CGGTTGAACG ATATAAGGCG
ATACAGGCAA GTAAAGCTAT CCAGGATGGA GACATAAAAT GA
 
Protein sequence
MKRIFHSLTT VALLGLSSAS AVASNWDAHW SFAGHGNDNF NGAHHGAFPL SGTGSFEFGK 
LNQALYYDIL KVDSLSIQKA AEPFSLSFWA LRETRDYAET LFSKQQSPFS AGMSVSLDGA
NAIVVDIRNG QGGSIKIKSQ QVWTDMNDWH HLVVTYNGSI QAGGISLYLD NQRLDVDVIS
DNLTGDIGAS DPVVIGADSE TSYATFNGAI DEVYLGSRAF NSSDIECLYA LRDNCVDPIS
DEPPVIAPQG PRGFEGPAGP QGLRGATGSQ GVKGAKGPIG DPGVKGPQGS QGSKGDLGLK
GLTGIAGNPG VDGRNGRDGK DGNAGLMGLQ GPDGLQGPKG VRGDTGPMGP QGDPGPQGLK
GATGAKGVTG ATGDAGPQGY AGAPGIPGAD GVQGNPGLPG LKGNTGSSGP SGAPGGKGPK
GLKGPHGDVI PGDNAPAFGP QGPRGPDGYN TYYSTGGRNF SMGTVLSTEK ALSPVERYKA
IQASKAIQDG DIK