Gene Sde_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3149 
Symbol 
ID3965583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4024154 
End bp4025332 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content51% 
IMG OID637922246 
Productphosphoryl transfer system, HPr 
Protein accessionYP_528618 
Protein GI90022791 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC CGATTGTTAT CGTAGGTATA GCGCGCACAG CCATGGGCGG TATGCAGGGT 
ATGTTTTCTG ATGTAAGTGC ACCACAATTA GGCGCGCAAG CGATAAAGGG CGCATTAGAG
GACGCAGGCC TAGCTACGAG TGATGTAAGT GAAGTGTTTA TGGGTTGTGT TTTACCAGCC
GGCACAGGGC AGGCGCCAGC GCGGCAAGCG GCTTTGGGCG CAGGGCTTGA TAAAGGTGTA
CCCACTACCA CTGTTAACAA AGTGTGTGGC TCGGGCATGA AAACTGTGAT GATGGGCTGC
CAGGCTATTT TGGCGGGTGA GGCCGAGGTG GTTGTAGCCG GTGGTATGGA GAGCATGACC
AACGCACCCT ATATGCTTAC TAAGGCACGT GGCGGTTATC GCTTGGGGCA CGGCCAAGTG
ATGGATCACA TGTTTTTAGA TGGCTTGCAA GATGCGTACG AAGGTGAGTT AATGGGTAAC
TTTGCAGAAG CTACGGCAAG TAAGTATGGC TTTACTCGCG AAGCGCAAGA TGCATTCGCA
ATTGAATCGC TCGCGCGCGC TAATAAGGCG ATTAACGAAG GCAAATTTAA GCGAGAAATT
ACCCCCTTTA CGTTAAAAAC ACGCAAAGGT GAAACCGTGA TAGACACCGA CGAGCAGCCG
GGCAATGCGC GTCCAGATAA AATCCCAAGT TTGCGCCCAG CATTCTCAAA AGATGGCACT
GTAACAGCCG CTAACTCAAG CTCAATTTCC GATGGTGCGG CAGCATTGGT GTTAATGAAG
CAGAGCACCG CACAAGCAAA AGGGTTAAAG CCCATTGCCA AAATTACCGG TTATACGCAA
CACGCCCACG AGCCAGAGTG GTTTACTACC GCGCCAGTGG GAGCGGTTAA AAACTTGCTA
GAAAAAACAA ACTGGTCTGC AAGCGATGTG GACTTGTTTG AAATTAACGA GGCGTTCGCG
GTGGTAAGTA TGGCTGCGAT GCACGATTTA GAGTTGGACC ACGCCAAGGT AAATGTTAAT
GGCGGCGCTT GTGCATTGGG TCACCCGCTG GGTGCATCGG GTGCGCGCGT ATTGGTTACA
CTCATTGCAG CGCTGCAAAA CGAGAGTAAA AAGACTGGAG TTGCCGCCCT GTGTATTGGT
GGTGGTGAAG CGGTAGCAAT GGCTGTTGAG CTGATCTAA
 
Protein sequence
MSDPIVIVGI ARTAMGGMQG MFSDVSAPQL GAQAIKGALE DAGLATSDVS EVFMGCVLPA 
GTGQAPARQA ALGAGLDKGV PTTTVNKVCG SGMKTVMMGC QAILAGEAEV VVAGGMESMT
NAPYMLTKAR GGYRLGHGQV MDHMFLDGLQ DAYEGELMGN FAEATASKYG FTREAQDAFA
IESLARANKA INEGKFKREI TPFTLKTRKG ETVIDTDEQP GNARPDKIPS LRPAFSKDGT
VTAANSSSIS DGAAALVLMK QSTAQAKGLK PIAKITGYTQ HAHEPEWFTT APVGAVKNLL
EKTNWSASDV DLFEINEAFA VVSMAAMHDL ELDHAKVNVN GGACALGHPL GASGARVLVT
LIAALQNESK KTGVAALCIG GGEAVAMAVE LI