Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_1421 |
Symbol | |
ID | 3966099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 1838804 |
End bp | 1840531 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637920498 |
Product | hypothetical protein |
Protein accession | YP_526895 |
Protein GI | 90021068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000284545 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT TAGACACAAA ATTCCCCAAA CGCGCCTTAA GCGTATTAAT GGCTTCTGCT ACAACACTTG CTTTAATAGG CTGTGGAGGA ACTGGGCAAG ACGACCAAAC GCCTTCTACT TCGTCTACGC AGTTTTCTGG GGTTGCTATC GACGGCGCGC TTGCCCGAGC GACGGTATAT TTAGACTCCA ACAACAATGC CACGCGCGAC CCGTGGGAGG ACTATGCTTT TACCGATAAT GATGGCTACT ACTCTTACAA CCCTAAAACA AATACAGATT ACTGTGCTAG CTCTGCGCCT GCTAGCGAAA AAATATACTG CCTCAAATCG TCTCGATCAT TCTCCAACGT GGTTGTCCGC GTCGACGGTG GCTACGACCT ACTTACCGGC GAGCCCTTTT TGGGGCAGAT GAGCCGCAGA ATAGAGCTAG AAGAATCAAC AAGCTCTGTG GATAGTGTTG TCTCCCCGCT TACAACGTTG ATGACAAATG TTGAGAGTAA CGATGATAGA AGTAACGTAT TAAATGCACT GAGGATTAGC GAAAATGATT TAGATGTTAA TTATTTAGAC TCCGATGGTA CGGGCAGTGT AAATTCTAGC TTGCTAAACA CGGCGCTTAA GGTGCATAAA ACAGTTACAG TTTTATCTGA CCGATTGAAT GACAATTACG AAGAGTTGAA TAATGAAGTA GGCACAATGA ACGACCCAAG CTCTGAGGTT TACCGCAGCT TAGCACAAGA GCTTACTGCC AATTCCGACA CAGGGTTAGA CAATATTCTG CGAGATACAC AAGCTTTAAC ACGCATTATG GATAACTCTG AGAGCGTACT ACGGGACTTG TACGAACAAA AGGAATTGGA TTTACCTGCA GATCTCGGGT CCTCCGAATC GCCCGATCAG TTTACTCGAG TGGTGAACAT TACAAGCAAC ATACCCAATA TTGTAAACAG ACTAATAGAC CCTATTAGCA CGATCGATAC TGGCGGCGCT CAGGGCCGCG CAAGAGCATT AGAAACGTTT GTGATTAAAA GCGTAAACGA AGGGCGCAAC GACGACTCGT CTATCGACAA CGCGGTTAAT TTTTTCACGA ATGAAGCTAA TACCACATTA GTTGATGCGT TAACATCTGC TTTATCGGGT GAACGAGGGG ATCTTTCAGC GCTTTTAAAC AACGATTTTA GCGGAAGTGA TTTCGATAGC GAAGAGGAGA TAAATCAAGC ATCGCGATTA GATGATAGCG CTCAACCCCT ATCGCTATTA GCTGGTAGCA CCTTAAAAGT ATCAGATTTA GATTTAGGCA GCGCTCCCAA CGACCTAAAA GATGCGGAAG TTGAATTTTA CTTTACGGGT GACACAGATG CCATTTCTGG TCAGTTTACA TCCTGCGTAA AATTCATAGA CGGTGCGAGT ACCGATGGCA CACTAGGCGA AGGCAATAGC CGCGGTGAGT TAGTGAACGG TTACTGGAGT ATGCTCGGAG CCAGTTCTGA CAACCGTTCT TCATTTTCCG TACTGCTTAC CATCGAGTTC TTAGGCGCTA CATATCAAGC CATTCTTAAA CCCAATGGAA CCGCAACAAT CGCCAACACC GAATATGAAC GAATTCGTTT TGATTTTGAT GGTGAAATTA AAAATTGGTA CAGCGTAGAC GGATTAACCA CAACAGAGAT TGTACCTACG TCGAATAAGG ATTGCGAAAC ACGCCTACCT TCGCGCGTAG GCATCTAA
|
Protein sequence | MKNLDTKFPK RALSVLMASA TTLALIGCGG TGQDDQTPST SSTQFSGVAI DGALARATVY LDSNNNATRD PWEDYAFTDN DGYYSYNPKT NTDYCASSAP ASEKIYCLKS SRSFSNVVVR VDGGYDLLTG EPFLGQMSRR IELEESTSSV DSVVSPLTTL MTNVESNDDR SNVLNALRIS ENDLDVNYLD SDGTGSVNSS LLNTALKVHK TVTVLSDRLN DNYEELNNEV GTMNDPSSEV YRSLAQELTA NSDTGLDNIL RDTQALTRIM DNSESVLRDL YEQKELDLPA DLGSSESPDQ FTRVVNITSN IPNIVNRLID PISTIDTGGA QGRARALETF VIKSVNEGRN DDSSIDNAVN FFTNEANTTL VDALTSALSG ERGDLSALLN NDFSGSDFDS EEEINQASRL DDSAQPLSLL AGSTLKVSDL DLGSAPNDLK DAEVEFYFTG DTDAISGQFT SCVKFIDGAS TDGTLGEGNS RGELVNGYWS MLGASSDNRS SFSVLLTIEF LGATYQAILK PNGTATIANT EYERIRFDFD GEIKNWYSVD GLTTTEIVPT SNKDCETRLP SRVGI
|
| |