Gene Sde_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1421 
Symbol 
ID3966099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1838804 
End bp1840531 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content45% 
IMG OID637920498 
Producthypothetical protein 
Protein accessionYP_526895 
Protein GI90021068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000284545 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACT TAGACACAAA ATTCCCCAAA CGCGCCTTAA GCGTATTAAT GGCTTCTGCT 
ACAACACTTG CTTTAATAGG CTGTGGAGGA ACTGGGCAAG ACGACCAAAC GCCTTCTACT
TCGTCTACGC AGTTTTCTGG GGTTGCTATC GACGGCGCGC TTGCCCGAGC GACGGTATAT
TTAGACTCCA ACAACAATGC CACGCGCGAC CCGTGGGAGG ACTATGCTTT TACCGATAAT
GATGGCTACT ACTCTTACAA CCCTAAAACA AATACAGATT ACTGTGCTAG CTCTGCGCCT
GCTAGCGAAA AAATATACTG CCTCAAATCG TCTCGATCAT TCTCCAACGT GGTTGTCCGC
GTCGACGGTG GCTACGACCT ACTTACCGGC GAGCCCTTTT TGGGGCAGAT GAGCCGCAGA
ATAGAGCTAG AAGAATCAAC AAGCTCTGTG GATAGTGTTG TCTCCCCGCT TACAACGTTG
ATGACAAATG TTGAGAGTAA CGATGATAGA AGTAACGTAT TAAATGCACT GAGGATTAGC
GAAAATGATT TAGATGTTAA TTATTTAGAC TCCGATGGTA CGGGCAGTGT AAATTCTAGC
TTGCTAAACA CGGCGCTTAA GGTGCATAAA ACAGTTACAG TTTTATCTGA CCGATTGAAT
GACAATTACG AAGAGTTGAA TAATGAAGTA GGCACAATGA ACGACCCAAG CTCTGAGGTT
TACCGCAGCT TAGCACAAGA GCTTACTGCC AATTCCGACA CAGGGTTAGA CAATATTCTG
CGAGATACAC AAGCTTTAAC ACGCATTATG GATAACTCTG AGAGCGTACT ACGGGACTTG
TACGAACAAA AGGAATTGGA TTTACCTGCA GATCTCGGGT CCTCCGAATC GCCCGATCAG
TTTACTCGAG TGGTGAACAT TACAAGCAAC ATACCCAATA TTGTAAACAG ACTAATAGAC
CCTATTAGCA CGATCGATAC TGGCGGCGCT CAGGGCCGCG CAAGAGCATT AGAAACGTTT
GTGATTAAAA GCGTAAACGA AGGGCGCAAC GACGACTCGT CTATCGACAA CGCGGTTAAT
TTTTTCACGA ATGAAGCTAA TACCACATTA GTTGATGCGT TAACATCTGC TTTATCGGGT
GAACGAGGGG ATCTTTCAGC GCTTTTAAAC AACGATTTTA GCGGAAGTGA TTTCGATAGC
GAAGAGGAGA TAAATCAAGC ATCGCGATTA GATGATAGCG CTCAACCCCT ATCGCTATTA
GCTGGTAGCA CCTTAAAAGT ATCAGATTTA GATTTAGGCA GCGCTCCCAA CGACCTAAAA
GATGCGGAAG TTGAATTTTA CTTTACGGGT GACACAGATG CCATTTCTGG TCAGTTTACA
TCCTGCGTAA AATTCATAGA CGGTGCGAGT ACCGATGGCA CACTAGGCGA AGGCAATAGC
CGCGGTGAGT TAGTGAACGG TTACTGGAGT ATGCTCGGAG CCAGTTCTGA CAACCGTTCT
TCATTTTCCG TACTGCTTAC CATCGAGTTC TTAGGCGCTA CATATCAAGC CATTCTTAAA
CCCAATGGAA CCGCAACAAT CGCCAACACC GAATATGAAC GAATTCGTTT TGATTTTGAT
GGTGAAATTA AAAATTGGTA CAGCGTAGAC GGATTAACCA CAACAGAGAT TGTACCTACG
TCGAATAAGG ATTGCGAAAC ACGCCTACCT TCGCGCGTAG GCATCTAA
 
Protein sequence
MKNLDTKFPK RALSVLMASA TTLALIGCGG TGQDDQTPST SSTQFSGVAI DGALARATVY 
LDSNNNATRD PWEDYAFTDN DGYYSYNPKT NTDYCASSAP ASEKIYCLKS SRSFSNVVVR
VDGGYDLLTG EPFLGQMSRR IELEESTSSV DSVVSPLTTL MTNVESNDDR SNVLNALRIS
ENDLDVNYLD SDGTGSVNSS LLNTALKVHK TVTVLSDRLN DNYEELNNEV GTMNDPSSEV
YRSLAQELTA NSDTGLDNIL RDTQALTRIM DNSESVLRDL YEQKELDLPA DLGSSESPDQ
FTRVVNITSN IPNIVNRLID PISTIDTGGA QGRARALETF VIKSVNEGRN DDSSIDNAVN
FFTNEANTTL VDALTSALSG ERGDLSALLN NDFSGSDFDS EEEINQASRL DDSAQPLSLL
AGSTLKVSDL DLGSAPNDLK DAEVEFYFTG DTDAISGQFT SCVKFIDGAS TDGTLGEGNS
RGELVNGYWS MLGASSDNRS SFSVLLTIEF LGATYQAILK PNGTATIANT EYERIRFDFD
GEIKNWYSVD GLTTTEIVPT SNKDCETRLP SRVGI