Gene Sde_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2229 
Symbol 
ID3964833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2832761 
End bp2833822 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content46% 
IMG OID637921320 
Producthypothetical protein 
Protein accessionYP_527701 
Protein GI90021874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00330354 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCAGT TTAGTCAATA TGAATGGACT TATTTAAGCC TACCCGCACG TTATTTTGCG 
TGGCGTATGC GTGGTAATGG GTTGAGTTGG ACCTACGAAA ATCATGTTGA TTTACACCAG
CCCTATGACT TGGTTATTGC AACGTCTATG GTTGATCTAG CCACATTACG TGGCTTAAAC
CCCCAGTTAC ATAATGTACC CGCGCTGCTT TATTTTCACG AAAATCAATT TGCTTACCCC
GTATCTAAGC AGCAGCCCGA TATTGTTGCT GCGCAAATGG TTAGCTTGTA TTCAGCGCTT
ACGGCGCAGC GTATTGTGTT TAATAGTGAG TATAACCGCT CCACTTTTTT CGACGGCTTA
GCAACGCTAC TTAAAAAGTT GCCCGATCAT GTGCCTAAAG GGATTGTGGA AGAGCTTACA
AATAAAAGCT CTGTGTTATT TGTGCCGCTA GCAAATGCAC CACAAGTGCA GGGTACGCGC
TTCAAGCAAG CAAGCGCGAT TCGCAATATA GTGTGGAATC ACCGCTGGGA ATACGACAAG
GGGCCGGAGC AATTATTGGC GTTTGCCACG GCTTTACCGC AAGGGTTGCC GATTAAAGTA
CATGTAGTAG GGCAACAGTT TAGGCAAATG CCCGAAGCGT TTGCGCAAGT GCGTCAATGT
TTGCAGGACA AAAAATATTT GGGCAAGTTT GGCTTTATTG CAAATAAAGC CGACTATATG
GAGTTACTAG GGCAGAGTGA TTTTGTGTTA TCTACAGCGT TGCACGATTT TCAGGGGTTA
TCTATTTTAG AAGCCGTTCA AGCTGGCTGT GTACCTATAG TGCCGAATCG CTTAGCGTAC
CAAGAAATAT TCGATGCGCA GTACAGGTAC CCCTCACATT TAGATAAGGC TGCCGAAGAG
GCTGTAGGGG TAATGGATAA ATTACAGCAA TTTTTAGCTA ACCCCACAAA GCAATTGCAT
GCACCTTCTG TTACCGAGTT AGAGTGGCGG ACGCTCAAGC CGGCTTACGA ACACATAATT
GAGCATTGCA GAGATTTAAA AACGGGGCGT CAGGGGGTAT GA
 
Protein sequence
MAQFSQYEWT YLSLPARYFA WRMRGNGLSW TYENHVDLHQ PYDLVIATSM VDLATLRGLN 
PQLHNVPALL YFHENQFAYP VSKQQPDIVA AQMVSLYSAL TAQRIVFNSE YNRSTFFDGL
ATLLKKLPDH VPKGIVEELT NKSSVLFVPL ANAPQVQGTR FKQASAIRNI VWNHRWEYDK
GPEQLLAFAT ALPQGLPIKV HVVGQQFRQM PEAFAQVRQC LQDKKYLGKF GFIANKADYM
ELLGQSDFVL STALHDFQGL SILEAVQAGC VPIVPNRLAY QEIFDAQYRY PSHLDKAAEE
AVGVMDKLQQ FLANPTKQLH APSVTELEWR TLKPAYEHII EHCRDLKTGR QGV