Gene Sde_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3892 
Symbol 
ID3967117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4912509 
End bp4914161 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content47% 
IMG OID637922989 
Producthypothetical protein 
Protein accessionYP_529359 
Protein GI90023532 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0718789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAT GTTGTATCAA GTTTTTTACC ACACTCTGTA CTGCTGTTTA TGTACTGGGT 
TGTGGCGCGT TAGCGCATGC GCAAACTGGC CCGGCGGGTT ATAACTACGC TGCTGCCGAA
AACGAAACGG TGTATTTAAA CGGAACCACC AACGTTGCCT ACGGCGCGAA TGGCTCGTTT
TATTACGCTT ACAATCAAAC CGGTTCGGTT AATTGCTCTA ATCAAACCTT TGGCGACCCT
ATTTTTGGGG TGCGCAAAGC GTGTTACACC CAGCAAGTGG CGAGCAATAA CCCGCCGTCG
GTTTCGTTTG CTAGCCCAAC GGGCAACCTA ACAGTCGACG AAGGTTATGC TCTTTCTGTA
ACCGTGAACG CCAGCGATAG CGACGGCAGC ATTGCCAGTG TAGAGCTGTT TATTAACAAC
CAATTAGTAC GACAAGAGCT TTACGCACCT TACGAGTGGG GCGCAGCTGC TGAGCCAGAC
GAATTAAATG GCCTACCCGT TGGTACACAC ACAATAAAAG CGGTAGCCAC CGATAACGAC
GGCGACACCA AACAAGCTAG CTTTAAGCTA ACGGTACGCG GTGCGGCGGT AGATGTTCCT
GGCTTGGTAC AAGCGGAGGA TTACACAGGC TTTTACGACA CAACCAATGG CAACACTGGT
GGTGCGTATC GCAACGACAA TGTAGACGTA GAAACCACAA GCGATAGTAA CGGTGGTTAC
GACGTAGGTT GGTTTGCGGC AAACGAATGG CTGGAATACC CCATTAACGT TACCGAAGCG
GGCAATTATG TATTAGAAGC ACGCGTCGCA TCTGCTGTAG GCGGTGGTAT GTTTACCGCA
GAAATAAATG GCAACAACAG CAGTACATTT AGCATAGGCA ATACCGGCGG CTGGCAAAAT
TGGCAAACCC TGAATAACAA TATTGGCAAT TTAAGCACCG GTAAAAAAAC GTTACGCATT
CAAGCGCAAA GCGGAAACTT TAATTTAAAC TGGCTGCGCC TCAAGCGTGC CACAACCTCT
GTGTGCACAC TTAATACTCC TGCCGAAAAC ATTCCAACGC CATTTAATTT ATTTACCGTT
ATCGATACAG ATTTAAACCG CTACGAATTC TGTAAAGCGT CTAAATGGTT TGAGGAATCT
AACGGTAAAC AAGTGTTTAA ATTATTTACC GGCGACAACC TAGCAGATAA CGTACCTGGC
GCCCGCGTAC ATGCTCGCAC AGAAGCTGGC CAAGGGCTTA AGTTTAAAGC AGGCTCCACA
TGGCACACCT TTGAAGCCAG AATGAAACCC AGCAAAAAGT TAGACTACAC TTACACCATT
GCTCAATTGT TTGCCGGCTG TTGCGGGCCG CAGTTGCGCA TTGAAGTAAA ATCTAACGGA
CGCATCCACA TGGGGTCGCG CGGTAACGGC AATATTCGTA TTAGTGACGA CCAAGATTAC
GCCAACGGCT CTAGATCGTT CAAAATTAAA ATTCGCACCA ATGGCGATCA GTTCGAAGTG
TATTTTAACA GCAGCAAAAA GTTCAGCGGC CGCACAGACG AAGCCAAGAA CGGCAATACC
AGTGCGCTTT ACCACTTCCG CTGGGGTGTA TATTCCAACG AAGTAATGAG CGAAGATTTA
TCTAACACTG TTACAGAAAT TATTCGAAAT TAA
 
Protein sequence
MKSCCIKFFT TLCTAVYVLG CGALAHAQTG PAGYNYAAAE NETVYLNGTT NVAYGANGSF 
YYAYNQTGSV NCSNQTFGDP IFGVRKACYT QQVASNNPPS VSFASPTGNL TVDEGYALSV
TVNASDSDGS IASVELFINN QLVRQELYAP YEWGAAAEPD ELNGLPVGTH TIKAVATDND
GDTKQASFKL TVRGAAVDVP GLVQAEDYTG FYDTTNGNTG GAYRNDNVDV ETTSDSNGGY
DVGWFAANEW LEYPINVTEA GNYVLEARVA SAVGGGMFTA EINGNNSSTF SIGNTGGWQN
WQTLNNNIGN LSTGKKTLRI QAQSGNFNLN WLRLKRATTS VCTLNTPAEN IPTPFNLFTV
IDTDLNRYEF CKASKWFEES NGKQVFKLFT GDNLADNVPG ARVHARTEAG QGLKFKAGST
WHTFEARMKP SKKLDYTYTI AQLFAGCCGP QLRIEVKSNG RIHMGSRGNG NIRISDDQDY
ANGSRSFKIK IRTNGDQFEV YFNSSKKFSG RTDEAKNGNT SALYHFRWGV YSNEVMSEDL
SNTVTEIIRN