Gene Sde_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3022 
Symbol 
ID3967744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3858012 
End bp3859946 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content44% 
IMG OID637922119 
Producthypothetical protein 
Protein accessionYP_528491 
Protein GI90022664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000919721 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAAAAA TAAACAGTAA TCAGTCTTTT CTAGTTTCTA TGCTCGCACC AGTAGTGCTT 
GCATTTAGCG CAATAACATA TGCAGCAGAG CCCGCCTCAG ACCGTATTTA TATGGGAGCT
TTTACAGACC GCAATTGGTC TGAATCCAAT CCGTGTGGAG ATAAACAAAC CGAGCACTGG
GAGTGGAATC AAAATTCTCA ATGGTTTGGC AACACCAATG CTTACTTTGG GAAAATATTT
CCCGAATGGT TTTTACCTGA TGGAGCGCCC AATATCTATC AAAACTGGAA GGTAGAGGTA
GAAAGAATTA AGGCACAAGG TCGTATCCCT TACATAAACT GGGAGGCCCA TGGCCAAACG
CAAGATGGTG AATGCTGGCA CGGTGGCAAT GGGCAGAATG CCCGAGCGGA TAGAGATGTA
ATATCTGAAA TTAATGCCGG TCAGCACGAC GACTTAATAG AAAATTTAGC CATAGGTTTA
CGGGAAGAAA AAACAAAAAT AGTTATCGAT TTATTTCACG AAATGAACGG CGCATGGTAC
GACTGGTCGC CTTCCTGTAA GCATTCTGCG GGTTGGCCGG CTTGGCGCCA AGCGTTTAAG
CATGTGGTTG ATATATTTAG AGCAAACAAT GCCTACAATG TTGAATTTGG TACATCTGTT
TGGTTTGGCG CAGATATTTG TGGCAACGGC ATACAAGACA CAGTCGATAA CATTAATATA
CCGGGCTACT TAGATTGGAT AGGCATTGAT ATTTATGGCG ACGTCTACGG CAACAGCTTT
TCTAACCTAA TGGACCCATG GTATAGCGCA CTCGCGGGTA CAGGGTTGCC CATCGTCGTT
GGCGAAATGG GAGTGGCGCT ATCAGATGAT GACGAAGCGA AGCGCACGTG GATGACAGAA
TTCCGTGATT CACTAGTCAA TCAATACCCG CGTGTAAAAG CGTTTAACTG GTTTGATATT
GATAAGGGGT GGGTAAATAA CGAGGCCAAT TACCATATAG AGTCAGGTAA TGCCGGGCCT
CATTTCAATC AATTAATGAC CGACCCTCGT TTTGTTGGCT CGTTTGGTTC GGTATTTATT
AAAAATAAAG CAAACAATAA ATGCCTCGCA ATTGCCGCTA ACAGCAATTC AAACACTGCC
AATGTTGAAG CTGCAGATTG TAATGGTGCT GCAAACCAAG TGTTTACTGT AACGGATCTG
GGCGACTTTA AACTGGAGCT TCGGGCGCTT AACAGCGATA AGTGTGTAGA TGTGGCTGGC
GCAGCTTCCG GTGATGGTGC CAACATCCAG CAGTATAGTT GTAATGCGAG TAAGGCCCAA
GTTTGGGTGA TGAATGTTAT TAATTGGTCT ACGATGGAAG TAGAGCTTTC TGCAGAAAAC
TCAGAGAAGT GTTTAGATCT AGATATGGCT TCATCTAATG TTCAGCAGTG GCAGTGTTAC
TCAAACATAA ACCAACGCTG GTTTTTAAAA ACAACCCCTA ACGCACCGGT TCAATTAGAT
CCGCCAAGCA TTAGCGGTAT CTACGAGCTA AAAAATACTG CAACTAATCG TTGTTTAGAT
ATAGCGGGCG GTTCACCCGA TACCGAGGCG AATGCACAAA CATGGTCTTG CAATGGTTCC
AGTGCACAGC GTTTTAAAAT TGTTGATCTT GGTGGCTATA GGCTTCTACT TCGGCCGCTA
AGTAATACTA ACGTCTGCAT AGATGTTTCT GATGTAAGTC AAGCTAATGG CGCTAATGTA
CATCAGTGGA CGTGTAATGG TGGCAATAAT CAGACGTGGA AGGTTGATGT TAAGAACTGG
GCTACCATGG AAGTAACGCT TTCTGCTGCG CATTCAAACA AATGCCTCGA TGAAGATCAG
GGCTCTAATA ATGTACAGCA GTGGGCGTGC AGTAATGCTA ATAACCAAAT TTGGAAATTA
GATGTTGTGA ATTAA
 
Protein sequence
MIKINSNQSF LVSMLAPVVL AFSAITYAAE PASDRIYMGA FTDRNWSESN PCGDKQTEHW 
EWNQNSQWFG NTNAYFGKIF PEWFLPDGAP NIYQNWKVEV ERIKAQGRIP YINWEAHGQT
QDGECWHGGN GQNARADRDV ISEINAGQHD DLIENLAIGL REEKTKIVID LFHEMNGAWY
DWSPSCKHSA GWPAWRQAFK HVVDIFRANN AYNVEFGTSV WFGADICGNG IQDTVDNINI
PGYLDWIGID IYGDVYGNSF SNLMDPWYSA LAGTGLPIVV GEMGVALSDD DEAKRTWMTE
FRDSLVNQYP RVKAFNWFDI DKGWVNNEAN YHIESGNAGP HFNQLMTDPR FVGSFGSVFI
KNKANNKCLA IAANSNSNTA NVEAADCNGA ANQVFTVTDL GDFKLELRAL NSDKCVDVAG
AASGDGANIQ QYSCNASKAQ VWVMNVINWS TMEVELSAEN SEKCLDLDMA SSNVQQWQCY
SNINQRWFLK TTPNAPVQLD PPSISGIYEL KNTATNRCLD IAGGSPDTEA NAQTWSCNGS
SAQRFKIVDL GGYRLLLRPL SNTNVCIDVS DVSQANGANV HQWTCNGGNN QTWKVDVKNW
ATMEVTLSAA HSNKCLDEDQ GSNNVQQWAC SNANNQIWKL DVVN