Gene Sde_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0122 
Symbol 
ID3967576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp152795 
End bp155086 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content50% 
IMG OID637919181 
Productinter-alpha-trypsin inhibitor domain-containing protein 
Protein accessionYP_525598 
Protein GI90019771 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAC ACCGCAAATT ACACCTAATG AACAAGCTTA AAAAAGCGCG CGCCGAGGGC 
ATAGCGTGGT TTTTTTATGC GCTGTTGGTA TTTTCTTTTA GCCTGTACAC CACCCAAAAA
GCCAACGCAG GTGAAGAGCC GGCATCTGGC CAATTAACGC TTGTGGATGC CACGGGCAAC
AGCTTGGATG CGCTGCATTT ATCCACCCAT GTGGATATGC AAATTAACGG CTTAATAGCC
AAGGTAACGG TAGAGCAGGC CTTTACCAAT AACAGCGATG AGTGGCGCGA AGGGGTATAT
GTGTTCCCCT TAGATGAACA GGCCGCAGTT AACGCCATGG AAATGGTAAT AGGTGATCGC
CGTATAAAAG GCGAAATTAA AGAAAAAGAG GTAGCCGAAA AAATTTATCA ACAGGCTAAG
GCCGAGGGTA AAAAGGCGAG CTTGGTATCG CAGCAACGAC CAAACCTGTT TACCCAAAAG
GTAGCCAATA TACCGCCGCG CGAAACCATT AGCGTAAGCC TTACCTACAC CCAAAGAGTG
GAATACCACA GTGGTCAGTT TGGTTTGCGT TTTCCGCTTA CACTTACCCA GCGCTACATA
CCTAATAGTG CAAATTTAGA GACGAACGTC GTCGAAAATA CGAAAAATTG GGACGATGAA
AGATGGGAGA ACTCAGCGCC AGATACGGCA GAAAAAACGC CGACTAGCAT AGACCTGGCC
GCTGGCGGTT ACGGCTGGCA GAGCTTTAAC CCCATAATTC ACACCCAAAA ACCCACCCCA
CAGGTGCCTG ATGCACACTT AATATCGCCG CCGATGGTAT TGGCCCAAGG GCAGTATGGC
GACGGGCAGT ACGAACAGAC CGGCAAAGAT AACCGCGCAA CCATAAGTAT TCAATTAGAT
GCGGGTTTTA ACGTGGCTAA CATCGAATCG CTGTACCACC AAATTACCAT TAACAAACCG
CCCAGCAGCG CCTACAACGT AGAGCTAACC AATGGCAGCA CTCTTATGGA TAGGGACTTT
GTATTGCAGT GGCGCGCAAC GGCAAGCAGT GCACCACAAG CTGCAGTATT TAAAGAAACA
CTAGCAGGGG AAGACTACCT ATTACTTATG TTGCTGCCGC CCCAAGGCCA ACAGCAACAC
ACGCAAAGCT TAAGCCGCGA CATTGTGTTT GTTGTGGATA CCTCTGGCTC TATGCAGGGT
ACTTCTATAC AGCAGGCCAA ACGCAGCTTG CAGTTTGCCC TGCGCGGGCT AAACCCCAGC
GATACCTTTA ACATTATTGA ATTCGATACA AGCTTTAGCC GCTTTCGCTC GCGCCCGGTA
AGTGCCACGG CCAGCAATGT GCAGGCAGCG GTAAGCTGGG TAAATAATTT AAATGCCGAT
AACGGTACCG AAATGTACGC CGCGCTCGAA GAGGCATTCG ACCAACTAGC CAGCATCAAC
CCAAACGGTA CAGAAAATAG CAAAAGCAGT AATAACCTGC AGCAGGTAGT GTTTATTACC
GATGGGGCAG TAGGCAACGA ACAAGCGCTG CTTTCGCTTA TTCACCGCCG CTTAAACAAT
GCGCGTTTAT TTACCGTGGC TATTGGCTCG GCGCCCAACA GCTACTTTAT GCGCAAGGCG
GCCCAGTTTG GCAAAGGTGC CAATGTGTTT ATAGGTGATA CCGCCGAAGT AACCCATAAA
ATGAATGCGT TGCTGAGCAA ATTAAAAACC ACCTTAGTTA GCGATATTAA TGTGCAATGG
CCGCAACAGT CGGAGGTGTA CCCGCAGCGC ATCCCCGATT TATATGCAGG CGAGCCGCTA
TTACTTGCGG CAAAAACCAG CGGTGCTATG GGCACAATCG ACATAAGCGG CAACACGGCG
TTGCAGCCGT GGCAATCCCA ATTAACCATC AACCCGTACC ACAACAACAG TGGCGTGGCG
CAGGTGTGGG CCAAGAGTAA AATCGATGCG CTAGAAGATT CAAAAACAGA AGGCGCCAAC
CCGCAAGATG TACGTAAACA GGTGGTAGAT GTGGCGCTTA CCCACGCGCT TATTACCCCC
TACACCAGTT TTGTAGCGGT GGAAGAGTTA GTGTCGCGGC CCGCCCACCA GCCTGTGCAA
ACGCAGGCTG TAGCTAACCT TAAGCCGCAA GGCCAAACGG TGAGTTACCC CAAAACAGCA
ACCTCCGCCA CGTTTAACTT AGTGCTGGGT ATTGGTTTGC TACTGCTGGC GTTTGCCCTG
CAGTTACGCA GTATTATTCG CGCGTTGTTA CACAGCTTTG CGCCCAGCGA ATGCAGCGGG
GTGAACGTGT AA
 
Protein sequence
MNTHRKLHLM NKLKKARAEG IAWFFYALLV FSFSLYTTQK ANAGEEPASG QLTLVDATGN 
SLDALHLSTH VDMQINGLIA KVTVEQAFTN NSDEWREGVY VFPLDEQAAV NAMEMVIGDR
RIKGEIKEKE VAEKIYQQAK AEGKKASLVS QQRPNLFTQK VANIPPRETI SVSLTYTQRV
EYHSGQFGLR FPLTLTQRYI PNSANLETNV VENTKNWDDE RWENSAPDTA EKTPTSIDLA
AGGYGWQSFN PIIHTQKPTP QVPDAHLISP PMVLAQGQYG DGQYEQTGKD NRATISIQLD
AGFNVANIES LYHQITINKP PSSAYNVELT NGSTLMDRDF VLQWRATASS APQAAVFKET
LAGEDYLLLM LLPPQGQQQH TQSLSRDIVF VVDTSGSMQG TSIQQAKRSL QFALRGLNPS
DTFNIIEFDT SFSRFRSRPV SATASNVQAA VSWVNNLNAD NGTEMYAALE EAFDQLASIN
PNGTENSKSS NNLQQVVFIT DGAVGNEQAL LSLIHRRLNN ARLFTVAIGS APNSYFMRKA
AQFGKGANVF IGDTAEVTHK MNALLSKLKT TLVSDINVQW PQQSEVYPQR IPDLYAGEPL
LLAAKTSGAM GTIDISGNTA LQPWQSQLTI NPYHNNSGVA QVWAKSKIDA LEDSKTEGAN
PQDVRKQVVD VALTHALITP YTSFVAVEEL VSRPAHQPVQ TQAVANLKPQ GQTVSYPKTA
TSATFNLVLG IGLLLLAFAL QLRSIIRALL HSFAPSECSG VNV