Gene Sde_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3961 
Symbol 
ID3967265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4991562 
End bp4992683 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content44% 
IMG OID637923058 
Producthypothetical protein 
Protein accessionYP_529428 
Protein GI90023601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.965049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.287654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT TAATAGAGGC ACCGGCTCCT TGGCCAGAAG CATTCGATGG TGAAATTATC 
CATCCAGAAC TTTACCTGTG GGATGCCTGG GGTTTTAGTG TTGCCGACGA GTTGCACTTG
TACTGTTTAG CTGTGCCTAA AAAAGCGATA GATGGCTCAC CTGTTGCTGC AAGCCAGAGA
AATAATTATC CCTTTCATGT TCGCCATTTT TATTCAGAAA ATTTAGGCAA AACGTGGTTA
GATAAAGGCG TTTTTCAACA GCCTAACAAT GCTAGTGACG GACACGATGC TCGCAATGTA
TGGAGTGGGT CTGTATTGCC CCTTTCTGAT GGTAGACTAG CAGTAGCGTA TACCGGTATA
CGTGAGCGCG GTAAAGAAAA ACCTTTTGTA CAAAACTTAG CAATTGGAAT TGCAGATAGC
GCGCAAACAA TGGGGGATAA AAGCGGTAAG GTGTTGTTTT GCCCAGAGTT ACACGAAGCT
TCTCTACGAG CAGCGGGTTA TTTCTTTGCT GAAAAAGACA AAATCGGTTT GGCTGGTGGC
GAAAATAATG GCCCAATTAC AGCGTGGCGC GATCCATTTT TAATTGCAGA TACACTCGAT
GAAAAGCAAC CATATAAACT CGTGTGGGCG GCAAAAAAAT CCGCCACTCA ATGCGCTTTT
GGTGCTGCGA GTATAAATCT AAGTAACGAA GATATATCGG CAACCCAGTT GTTTGGCCCT
ACAACATTGC CAGATGATGA CGAGTTTACT CAATTAGAGT TACCGCAAAT CTATGTGGAC
GAACTAAACA AACGCTATGT TCTCATCGCA GCGACTACTA CGCGAACAAG CGAAGCGCAA
AGCGAAAGTG AAGTGGATAA ACGTATACGC TTATATACTG CGCCAAGTTT AACTGGCCCT
TGGCAAAAGG CCGGTACGCA AACTAGCGAA GTGGACGGCT TAGAGAGTTT ATTTGGTATG
ACTGTTTTAA AAGCGGATTT CGAAAACGAT ACACTCTACT GCATGGCTCC GTACACTGAA
GCGACAGCCC CCGAGCAGAT ATTATCTTTT GCGCCAATAG TTAAAATAGA TTTGAATGAG
ATAGGCAGGT TGCAAAAAAT TTCTGCCAAA CCTGTTTACT AA
 
Protein sequence
MSKLIEAPAP WPEAFDGEII HPELYLWDAW GFSVADELHL YCLAVPKKAI DGSPVAASQR 
NNYPFHVRHF YSENLGKTWL DKGVFQQPNN ASDGHDARNV WSGSVLPLSD GRLAVAYTGI
RERGKEKPFV QNLAIGIADS AQTMGDKSGK VLFCPELHEA SLRAAGYFFA EKDKIGLAGG
ENNGPITAWR DPFLIADTLD EKQPYKLVWA AKKSATQCAF GAASINLSNE DISATQLFGP
TTLPDDDEFT QLELPQIYVD ELNKRYVLIA ATTTRTSEAQ SESEVDKRIR LYTAPSLTGP
WQKAGTQTSE VDGLESLFGM TVLKADFEND TLYCMAPYTE ATAPEQILSF APIVKIDLNE
IGRLQKISAK PVY