Gene Sde_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0414 
Symbol 
ID3965990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp510734 
End bp512110 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content48% 
IMG OID637919477 
Producthypothetical protein 
Protein accessionYP_525890 
Protein GI90020063 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAG GGTTGTTTAA GCGATTTGCA TTTATAGCGC GCGCGTTGCC ATCTAAATTG 
GCGGTAGGCT GCTTTTTATG GGGCGGCGCA ATTTTATTTT TGCTTACCAG TTTGTTTGCG
TTGCTAGAAA TAGAATCTAT TGAAAGCAGC GTAACCACGT TTGGTTTTGC GCTGCTGTTA
TTTTTTGTTG GCTTATTGCT GCTAGATTGG CTGGCGTGCC TTGCGCAAAA GCCCATTTTT
GTCGAGCGCG AATTGCCCCA CAGTATTGCC GTTAACCGCT GGACCAGTAT AACCCTTGTG
TTGCACCACC ATTTTGCAAG GCCGGTTAAG CTGCAACTGC ACGATGGGCT TGCCGATCAA
TTACTGTGTG AAGATATGCC GCTGCAGGTG CAATTGCGCC CAGGGCAATA CTCCAAGGTG
GCCTACAAAA TAAAACCTGT CGTGCGCGGC AATCACACAA TTACACCCTG TACCCTGCGG
GTAGAAAGTC CCTTTCGCCT TTGGTTTAAA CAGTACCAAG CGGGTGCGAG CAGCGAGATA
AAGGTATACC CAGATTTCGC CGCAATATCT GCCTTTACAA TAATGGCCAC CGAAAACCAT
ACCAGCCAAA TAGGTATTAA ACGCCGACCA AGACGCGGCG AAGGTATGGA GTTTTTACAA
CTGCGCGATT ATCGTCGCGG CGATTCGCTG CGACAAATAG ATTGGAAAGC CACTGCGCGG
CGCCGCGAAT TAATATCCCG CGAATACCAA GATGAACGCG ATCAACAAAT TGTATTGCTT
GTGGATAGTG GTAGACGCAT GCGCGCACTA GATGGCGAGC TTAGCCATTT TGACCACTCG
CTAAACGCCA TGCTGTTAGT AAGTTATATA GCCTTGCGCC AAGGGGATAG CGTAAGTGTA
ATGAGCTTTG GCGATGGCCA CCGCTGGATA CCGCCGCAAA AAGGCGCCGG CAAAATGAAA
ACCATATTAA ACGGCATGTA CGACTTAACC GCAGAAAACT GCGCGGCCGA TTACGTTGCC
GCGGCCGAGC AATTAGCTAT ATTACAGCGC AAGCGTTCGC TGGTTATTGT GGTTACTAAT
ACCCGCGATG AAGAATCCGA TGAGTTGGTA ATGGCGGTTA ACCTATTGCG CAAACGCCAC
GTGGTACTAG TGGCCAATAT TCGCGAAAAC ATTTTAAAAG AAATGCACGA TAGCGAAGTA
AAAGATATGG ATACCGCACT AGATTACCTC GCGGTAAACG ACTACATGCA AGCGCGCCAA
CAGGCCCAAA ACGAAATTCG TAACCAAGGT GTATACGCCA TAGATTGCCA GCCATCGGAA
TTGGCGGTGA AGGTGGCGAA TAGTTATATT GAGATAAAGA GGGCGGGAGT TTTGTGA
 
Protein sequence
MFTGLFKRFA FIARALPSKL AVGCFLWGGA ILFLLTSLFA LLEIESIESS VTTFGFALLL 
FFVGLLLLDW LACLAQKPIF VERELPHSIA VNRWTSITLV LHHHFARPVK LQLHDGLADQ
LLCEDMPLQV QLRPGQYSKV AYKIKPVVRG NHTITPCTLR VESPFRLWFK QYQAGASSEI
KVYPDFAAIS AFTIMATENH TSQIGIKRRP RRGEGMEFLQ LRDYRRGDSL RQIDWKATAR
RRELISREYQ DERDQQIVLL VDSGRRMRAL DGELSHFDHS LNAMLLVSYI ALRQGDSVSV
MSFGDGHRWI PPQKGAGKMK TILNGMYDLT AENCAADYVA AAEQLAILQR KRSLVIVVTN
TRDEESDELV MAVNLLRKRH VVLVANIREN ILKEMHDSEV KDMDTALDYL AVNDYMQARQ
QAQNEIRNQG VYAIDCQPSE LAVKVANSYI EIKRAGVL