Gene Sde_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3039 
Symbol 
ID3967703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3885672 
End bp3886763 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content45% 
IMG OID637922136 
Producthypothetical protein 
Protein accessionYP_528508 
Protein GI90022681 
COG category[S] Function unknown 
COG ID[COG4299] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000596347 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACAC AACGCTATTT GGCCCTCGAC GTTATGCGCG GGGCAACGCT CGCCATGATG 
ATACTTGTGA ACACCCCCGG CGACTGGGGC TTTGTTTACG CCCCCCTGCT ACATGCAGAT
TGGCATGGTG TCACCATTAC CGATTTTGTG TTTCCGTTTT TCCTTTTTAT TATTGGTTCG
GCGTTATTTT TTACTAGCCG TTCTAGCGGG CAGCTAGCCC CAGCAATTAA AGCTAAAAAA
ATAATTAAGC GTACAGCGCT GCTATTTACT ATTGGCTTAT TGCTGCATGC ATTCCCTTTT
ACTACGGCGC TTAGTGAGTT ACGCATACTA GGCGTATTGC AACGCATAGC GCTAGCCTAT
GGCATAGCGG CGTTTATTGT ATGGCTACCC ACCACGCAAC GGCTAATGGC GGCGCTAGGC
ATATTAGTAG CCTACTGGCT TGTATTTATA CTCACCGATA GCAGTTACCA TTTAGCAGAC
AATATTGTAA GGCACATAGA TATTACCATT TTAGGCGCAG AACACTTATG GCAAGGTAAA
GGCTTAGCCT TTGACCCAGA GGGCTTACTT AGCACCTTAC CTGCCGCCGT AAATATATTG
GCGGGCTTTG AAGCTACACG TTTATTGGTA AGCCAACCAG CTGGCGAGCC AAATAATGCC
ACCAGCCGCC AATTTAAATT GGCGCTGTAC GCCATGTGCA GTATTACTAT TGCATTAATT
TGGCACCGCT GGATGCCCAT AAATAAATCG CTTTGGACAA GCAGCTTTGT GCTGCTAACT
AGCGGCGTGG GTGTGCTAGT GCTTTTATTA TTAGTTAGAT TAGAACCTTA CCGCGCAACT
GCAGCTATTT ATCGCGCCTT CGCAATTTAT GGCCAAAACC CATTGTTTAT TTATGTATTA
TCTTCACTTT GGGTGCAGTG CTATTTTCTG TTTCATATAG ACGGCGTAAA TATTTATGCT
TGGCTGAATA ATCAACTGAA CTCAATTGCC GAACCTTATT TGGCAAGCTT GCTATTTGCT
CTGGGGCATG TCGCGTTGTT TTGGGGAGTG GCATACGCAT TACATAAAAA GCGTATTGTA
ATAAGTGTTT AG
 
Protein sequence
MATQRYLALD VMRGATLAMM ILVNTPGDWG FVYAPLLHAD WHGVTITDFV FPFFLFIIGS 
ALFFTSRSSG QLAPAIKAKK IIKRTALLFT IGLLLHAFPF TTALSELRIL GVLQRIALAY
GIAAFIVWLP TTQRLMAALG ILVAYWLVFI LTDSSYHLAD NIVRHIDITI LGAEHLWQGK
GLAFDPEGLL STLPAAVNIL AGFEATRLLV SQPAGEPNNA TSRQFKLALY AMCSITIALI
WHRWMPINKS LWTSSFVLLT SGVGVLVLLL LVRLEPYRAT AAIYRAFAIY GQNPLFIYVL
SSLWVQCYFL FHIDGVNIYA WLNNQLNSIA EPYLASLLFA LGHVALFWGV AYALHKKRIV
ISV