Gene Sde_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0022 
Symbol 
ID3968155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp25495 
End bp26553 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content49% 
IMG OID637919081 
Producthypothetical protein 
Protein accessionYP_525498 
Protein GI90019671 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.124597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ACTTGCTTGG CGTGCTTTCT GCCATCGCCT TATCTGCGCA AGTCGCCACT 
CAAGCCTACG CGGAACAGCC TCAACTTCGC GATGAGATCC CAGCGACCCA TACAGTAGTT
AAAGGGGACA CGCTGTGGGA TATTTCCGCG ACCTTCCTTA AGAATCCTTG GATGTGGCCA
GAAATTTGGC ATGTAAATGC CCAAATTGAA AACCCGCACC TTATCTACCC TGGCGACGTA
ATTCGCTTGA TCTATGTAGA TGGCAAACCA CGTTTAACGC TCGATACCAG CGGCCGCGTT
TATAAAATGT CGCCTCAGGC GCGCGTTTTA TCTGCTGAAG AGGCCATTGA AACGATCCCG
CTCGAAAAAA TTAACAGCTT TTTGTCACGC AGCCGCGTGG TTGGCGAAAA CGATTTTGTA
GGCGCGCCCT ATGTGCTTTC TGGTTTAGAT CAGCACTTAT TGGTAGGCGC TGGCGATAAA
ATCTACGGTC GCGGCAATTT TGCCGAGCGC GGCACGGTGT ACGGTATTTA CCGTCAGGGT
GAAATCTTTA AAGACCCAGA AACCAAAGAG ATTTTGGGTG TACAGGCGCT CGATATCGCT
ACTGCATCAT TAATGCGTGT AGAAGACGAT AACGATGCAA AAGACGATAT TGAAATTGGC
ACCTTAAGTG TTTCTCGCAC CACAGAAGAA GTGCGTATCG GCGACCGCTT CTTGCGCCAA
GAAGAACGCC CCATCGACTC GACTTTCTTC CCATCGGCCC CTAACACCGA AACCGAAGGT
GTCATTTTGG CGGTTGAAGG CGGTTTAACC CAAGTGGGTA AAATGGACGT TGTTGTTATA
AACCGCGGCG AGCGCGAAGG CATGACAGCA GGCACGGTAC TTGCCGTTTA CAAGCGTGGC
GGTGTTATAC GCGACCGAGT GAGTAAAGAT AGAGTAACTT TGCCCGATGA GCGTGCCGGT
GTTTTGATGA TTTTCCGCAC CTTCGAGAAA GTAAGCTTTG GCTTAATATT AGAAGCGGAG
CGCGGCATTT CGGTAAAAGA TAAAGTACGC AACCCATAA
 
Protein sequence
MKKYLLGVLS AIALSAQVAT QAYAEQPQLR DEIPATHTVV KGDTLWDISA TFLKNPWMWP 
EIWHVNAQIE NPHLIYPGDV IRLIYVDGKP RLTLDTSGRV YKMSPQARVL SAEEAIETIP
LEKINSFLSR SRVVGENDFV GAPYVLSGLD QHLLVGAGDK IYGRGNFAER GTVYGIYRQG
EIFKDPETKE ILGVQALDIA TASLMRVEDD NDAKDDIEIG TLSVSRTTEE VRIGDRFLRQ
EERPIDSTFF PSAPNTETEG VILAVEGGLT QVGKMDVVVI NRGEREGMTA GTVLAVYKRG
GVIRDRVSKD RVTLPDERAG VLMIFRTFEK VSFGLILEAE RGISVKDKVR NP