Gene Sde_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3999 
Symbol 
ID3967418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp5031922 
End bp5034000 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content46% 
IMG OID637923096 
Product3-phytase 
Protein accessionYP_529466 
Protein GI90023639 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.739882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTCTT GCTTTATAAG CGTGAACGAT TGGGAGAAAA CAGTGAACCA TCAACACAAT 
AAAAACCATA CAACTACAAA GAACTTAGTG CGCCGCAGTA CTTCCATTGC ACTTATATGT
GCAATTACTG GCCTATTGGG TTGCAACAGC ACAAGCCAAA AAACACTTTA CGATACAACG
GATAACCAAC CCTTGTATGC GCAGAACGCC AGCGTATCGG TTAGTAATAT TAAAGCGGGT
AATAAACTTA GTGGGCTAGC GGTAGTAACC CACAAGGGCC GCCAAGTAAT GGCGATAGCA
TCAGAAAAAA CAGGTATAAT CCTTGCGGCT ATTAATAGCG ATAGCCAAGC ACTTAACACA
CAGGTAATTA GCCAATTAAA AGGCAGCTAT GAGCTACTTG ATAGCAGGCA AACAAGTCAG
CGCCAATGGC TAATTACAGC CGATGCGCAA ACTGGGCAGC CGGTAATATT CAGTGCAACA
CAAGATAGCG ATTTAATGTC TGCAACTGCA ACAGCACTGC GAGAGACGCG TTTTCAAACC
GATGCGCTGT GTTTGTATTT AGATAAGCAA AATAATTTGT TCGCATTTAT GCTCGATGGC
TACGGCGGTG GCGAAATGCG CTGGTTGTGG GATGCACGCA AAGATTCACT GGTAGATATT
ACCGTTAAAC AGCTTAGTTT GCCGCCGGGG TCTGAATCTT GTGCCGTAGA TGATGCAAGC
GCAGCGCTAT TGGTTGCGGA AGAAGAGTTT GGTGTGTGGC AATACCCCGC AGAGCCAGAA
GGCGCTTGGC AGCGCCACTT AGTTGCTGCA GTAAAACCTT GGGGCAAAGT ACAAGCAAGC
CCTATAGGTG TAAAAGCCAT TGCGCCCTAC CACTTTGCCT TGTTTAGTGA GCAAGGTGCG
GCAGTGTATC GTTTAAATAA TAAAGGTACG CATGCGGAAG TAGTTGCTCT GGGTAAATTA
AACGCGCAAG AAATAGCAGA TGTAGTATGG CTAAACGATA AACTACTCCT ACTCGATGAA
GCGGAAGATG CTATTGTTAG CGCACCCTTA GCCGCTAAAA AAATAGAAAA GTCTATTGCC
ACTGCTAAAC AGCAATTGGC CCCACTACCT GCGGTTTACC CCACAGTAAC AGCCAAAGCG
CAAACCCCCG CAATGCAAAG GCGCGGTGAT GCAGCAGATG ACCCAGCTAT ATGGGTGCAC
CCAACCCATC CAGAAAAAAG CTTGGTTTTG GGTACCAATA AAAAATGGGG TTTATTTGTT
TACGACTTAC AAGGCAACGA AACACAGGCT ATAGCTACCG GCCATATTAA TAATGTTGAT
ATTCGCCAAG GCGTACGTTT AGCGCCTAAC CAAAAAGCGC AAGATATTGC CATTGCCAGC
AACCGTAGCG ACAACACGTT AACGGTATAT ACGCTTAACA ACGGCCATGT AAAACAAGTG
GCAAATATTG CCACTGGGTT AAATGATGTC TACGGCGTGT GTTTATATGC ACCTAACAAG
CAAGCATTGT ATGCGTTTAT TAACGATAAA GACGGCCGCT TTAAGCAGTA CCAATTAATC
GAAAATACGG CGGGTATAGA TGCAAACTTG GTGCGTGAAT TCCATTTAGA TAGTCAGCCA
GAAGCGTGTG TTGCCAACGA TGCAACCGGC GAGCTATTTA TCGGTGAAGA AGATGCCGGT
GTGTGGTTAT TCGACGCCAA CCCAACAGCA AGTATTAGCG GCCGCTTAAT AGCCGCGGTG
GGTGATGTGT TGGTGGCCGA TGTAGAAGGC TTAGGTTTAA TAAACAATGC ATTGGGTAAT
TATTTGGTTG TTTCTAGCCA AGGTGATAAT AGCTACGCTA TTTACCAAGC AGAGGCGCCT
TACAATTACG TAGGTTCCTT CCGCGTGGGC TTAAACAGTG ACAACCAAAT AGATGGCACA
TCGGAAACCG ACGGCATAGC TATAACCGGT GCAGCATTAG GGGAACACTA CCCGCAGGGC
CTATTAGTTA TTCAAGATGG CTTTAACCTA ATGCCTAGTC AGCCGCAAAA CTTTAAATAT
GTAAGTTGGC AAGATGTAAT CACCGAGTTG GCTAAATAG
 
Protein sequence
MCSCFISVND WEKTVNHQHN KNHTTTKNLV RRSTSIALIC AITGLLGCNS TSQKTLYDTT 
DNQPLYAQNA SVSVSNIKAG NKLSGLAVVT HKGRQVMAIA SEKTGIILAA INSDSQALNT
QVISQLKGSY ELLDSRQTSQ RQWLITADAQ TGQPVIFSAT QDSDLMSATA TALRETRFQT
DALCLYLDKQ NNLFAFMLDG YGGGEMRWLW DARKDSLVDI TVKQLSLPPG SESCAVDDAS
AALLVAEEEF GVWQYPAEPE GAWQRHLVAA VKPWGKVQAS PIGVKAIAPY HFALFSEQGA
AVYRLNNKGT HAEVVALGKL NAQEIADVVW LNDKLLLLDE AEDAIVSAPL AAKKIEKSIA
TAKQQLAPLP AVYPTVTAKA QTPAMQRRGD AADDPAIWVH PTHPEKSLVL GTNKKWGLFV
YDLQGNETQA IATGHINNVD IRQGVRLAPN QKAQDIAIAS NRSDNTLTVY TLNNGHVKQV
ANIATGLNDV YGVCLYAPNK QALYAFINDK DGRFKQYQLI ENTAGIDANL VREFHLDSQP
EACVANDATG ELFIGEEDAG VWLFDANPTA SISGRLIAAV GDVLVADVEG LGLINNALGN
YLVVSSQGDN SYAIYQAEAP YNYVGSFRVG LNSDNQIDGT SETDGIAITG AALGEHYPQG
LLVIQDGFNL MPSQPQNFKY VSWQDVITEL AK