Gene Ssed_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_0039 
Symbol 
ID5613695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp45367 
End bp47139 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content45% 
IMG OID640930862 
Productcollagenase 
Protein accessionYP_001471780 
Protein GI157373180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000449009 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0513701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGCC GTTCTCTTAT TGCCATCACG CTTATCTCTC CCACACTCAT GACATTAAAT 
GGGTGTAGCA GCTCTCCTAA CTTAATAGTA GACCAGACAC TCATCAGCCA GATTAATCTG
GCGAATAAAG ATTCATTTTT TGATGCCAGC AACCCGCAGT TAAACAAAAA TGCATTTGAA
CAGATAAGTC TTGCCTTAGC GACAACAAGC GATCAGCAAC AAGCCGATAA CCTTCTTTAT
TATCTACGCG CGTTTAGCTA TTTTGGGCCC ATCGATGAGC TTGATGACAC AAGCTACGAA
TCACTCACCT TCGGGCTTAA ACAACTCGCT AATAATGAGT TGTTGAGTCA CTCCTCTCGG
CTACAGGAAC AATATGCTGT TACCCTGTAT CGTTACTACG CTAACAGTGA ACGTGCAACA
CAACTTGCCA CCCTCTTACC TCAACTTGAC TCACAGTTAT CCTCACTAGG GCAGAGCGCT
TCCAATACGG ATAATGACTA CGCACTTTGG GAGACGCTCA GAGCCTATGG TCTGTTATTT
AATACAGCCA GAAAAGAAGC GGATGGGGAT CTTAACAAAT TATTGGTAGA ACAGGATTTG
GCGCGTCCGC TACTGGAGTT TGCAGCATCA ACGACAAGCA TAAGATCCGA TAATGACTGG
CCTAAGGCAA ATGCTTATTG GGCTTTAGGG CTGTACCGTT TAGCACTACC GGCGAGTGAG
AATGATGAGC CTACCCCTTC AGAGCAGAAG ATAGACGATG CGGTTGCCGA TATAGCCAAG
CAAGACATTA AGCTACGCGG TGACAAAGCC AAAGATACCT ATACCTTGGG CTATCATGTC
AATGCTTTTG CGGGTAAAGA GGCATGCGAA AACAACAGTG AGCTCTGCCG CATACCAGAG
TTAGAAGATG TTCTGCCTAT CAATCATTCC TGCTCTGACA GTTTATTTAT TCTCGCACAA
GACTTAAATG AGCAAGAGCT TGCGACAAGC TGCACTAAAC TCACCTCGCA AGAGGCCAAC
TTTCATCAAG TTCTGGAAAC CCGATATCAG CCTACCGCCA ATGACTTCAA CAATGCACTG
CGTGTTGTGG CGTTTAAAAA TTGGAGCCAG TACAACGCAT ACGGACAACT ACTTTTCGAC
ATAGATACTG ATAATGGTGG CATGTATATC GAGGGTACTC CGTCAAAGCC TGGTAACCAG
GCAACATTCT TCGCCTACCG ACAGTTCTGG ATCGAGCCAG AATTTGCTAT CTGGAATCTA
AACCACGAAT ACGTACATTA CCTCGACGGT CATTTCGTCA AATATGGTGG CTTCGGGCAT
TTCCCGGAGA AGATGGTGTG GTGGTCGGAA GGCTTGGCCG AATACATATC CAAGGGCAAC
GACAATCCAA ATACGTTAAA AGTGATCAAG AAAGATATCG ATAAGGCCCC CAGCCTTGAA
GAGATCTTTG CCACTGAGTA TAAAGACGGG CAAGACAGAA CATATAAGTG GAGTTATATG
GCAGTGCGCT TTCTGGTTGA AAACCACCAT TCAGATTTCG TTCAACTGAG TCATTATTTA
AAAACGGATT ACTTTGAAGG TTACGCTGAG TTGATGGCTG AGCTCACTGA CCATCAGGCG
CAATTTTCAG ATTGGCTAAA TGTACAGGTG GAACAGTTTG ATGATAGCGA AGAGAAAGCT
AAGCCCAGAC TCAATAAGCA AAATAGATAT AGCTACCGGG ATTACCTGCG ACCCGCTCAC
TTGGTAAAAG ATGATGCTCA CAGACATTAT TAG
 
Protein sequence
MFRRSLIAIT LISPTLMTLN GCSSSPNLIV DQTLISQINL ANKDSFFDAS NPQLNKNAFE 
QISLALATTS DQQQADNLLY YLRAFSYFGP IDELDDTSYE SLTFGLKQLA NNELLSHSSR
LQEQYAVTLY RYYANSERAT QLATLLPQLD SQLSSLGQSA SNTDNDYALW ETLRAYGLLF
NTARKEADGD LNKLLVEQDL ARPLLEFAAS TTSIRSDNDW PKANAYWALG LYRLALPASE
NDEPTPSEQK IDDAVADIAK QDIKLRGDKA KDTYTLGYHV NAFAGKEACE NNSELCRIPE
LEDVLPINHS CSDSLFILAQ DLNEQELATS CTKLTSQEAN FHQVLETRYQ PTANDFNNAL
RVVAFKNWSQ YNAYGQLLFD IDTDNGGMYI EGTPSKPGNQ ATFFAYRQFW IEPEFAIWNL
NHEYVHYLDG HFVKYGGFGH FPEKMVWWSE GLAEYISKGN DNPNTLKVIK KDIDKAPSLE
EIFATEYKDG QDRTYKWSYM AVRFLVENHH SDFVQLSHYL KTDYFEGYAE LMAELTDHQA
QFSDWLNVQV EQFDDSEEKA KPRLNKQNRY SYRDYLRPAH LVKDDAHRHY