Gene Sde_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3535 
Symbol 
ID3966377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4490789 
End bp4492105 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID637922632 
Producthypothetical protein 
Protein accessionYP_529002 
Protein GI90023175 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA AGCAGGACAA AAGCGACACA TCGGCACATG TTCACGCGGC ACATGCTCAC 
GCGATATTGT TAGAGCGCCT GTTAGCGGTG CGCAAGCGCA CAGAACAATT GTGCCAAACA
CTCTCAGATG CCGACGCCAC GGCGCAATCT ATGGATGATG CCAGCCCCGC CAAATGGCAT
TTGGCACACA CCAGCTGGTT CTTCGAAGAG TTTCTTTTTG CCCCCGCCCT TGGCGAACAA
GCGCGCTTTC ATCCGCGCTA TAGCTACTTA TTTAATTCCT ATTACGATGG CGTAGGCAAT
CGCCACGCGC GCCCACAGCG CGGTTTACTC ACGCGGCCTA GCCTCGCCGA AATATTAGAA
TATCGCGCCC ATGTAGATAA CTTATTCGCC ACACTCTTTG CCAGCTTAAC TCCACAGCAA
TTTTCTCTTA TAGAGCTAGG CATAGCCCAC GAGCAACAAC ACCAAGAATT ATTACTAACC
GATATTCTGC ATTTGTTTTC ACACAACCCT TTGGCACCTG CGCTACAGCC AAAGGTAGCA
GCACCACTAG CCGCAATTGC CAGCAACACT ACCCCGCCAT TAAATTGGGT GACATTTGAA
GGGGGTTTAA TAGAGATAGG GGCAACCGCT ACGCATTTTT GTTTTGATTG CGAACTACCC
AAACACAAGC AATACCTAAC GCCATTCGCC TTGGCGAGCC GCGCAGTTAC CAATAGGGAG
TGGCTGGCTT TTATTAATGA TGGCGGTTAC AGCAACCCAC TTGTTTGGCT CTCGGACGGC
TACGCCACTG CCGTTAAAAA TAATTGGCAA GCCCCGCTGT ATTGGCAGCA ACAGGCTGAC
GGCGAATGGC ATACGTTTAC CTTGCACGGG GCCAGCCCAC TGCAATTGGA TGCACCCGTG
TGCCACATTA GTTTTTACGA AGCCGACGCT TATGCCCGCT GGGCTGGCGC GCGCTTACCC
ACCGAGGCCG AATGGGAGCA TGCCGTAGGC AATAGTGCAG TGGAAGGCAA TTTTGCCAAC
AGCGATTTAA TTATGCCAAA ACCACAAAAA TCAGCCGCCA ATAAAAATGT ACTTACAGGC
ATGTTTGGCG ATGTATGGGA GTGGACCGCC AGCGCCTTTG CCCCCTACCC TAACTTTAAA
GTTGCCGAAG GGGCAGTAGG TGAATACAAC GGTAAATTTA TGAGCGGCCA GATGGTATTG
CGCGGTGGGT CTTGCGCCAC ACCACCTGAC CATATGCGCG CCAGTTACCG CAATTTTTTT
CACCCAGATA AGCGCTGGCA ATTTTCGGGC CTGCGTTTAG CGAAAGGTGC AAACTAA
 
Protein sequence
MTTKQDKSDT SAHVHAAHAH AILLERLLAV RKRTEQLCQT LSDADATAQS MDDASPAKWH 
LAHTSWFFEE FLFAPALGEQ ARFHPRYSYL FNSYYDGVGN RHARPQRGLL TRPSLAEILE
YRAHVDNLFA TLFASLTPQQ FSLIELGIAH EQQHQELLLT DILHLFSHNP LAPALQPKVA
APLAAIASNT TPPLNWVTFE GGLIEIGATA THFCFDCELP KHKQYLTPFA LASRAVTNRE
WLAFINDGGY SNPLVWLSDG YATAVKNNWQ APLYWQQQAD GEWHTFTLHG ASPLQLDAPV
CHISFYEADA YARWAGARLP TEAEWEHAVG NSAVEGNFAN SDLIMPKPQK SAANKNVLTG
MFGDVWEWTA SAFAPYPNFK VAEGAVGEYN GKFMSGQMVL RGGSCATPPD HMRASYRNFF
HPDKRWQFSG LRLAKGAN