Gene Sde_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2985 
Symbol 
ID3967746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3789988 
End bp3791991 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content46% 
IMG OID637922082 
Producthypothetical protein 
Protein accessionYP_528454 
Protein GI90022627 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000400495 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00618409 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAACC TTATCCGCTG TAATGCATTG CTTGCGGCAT TTGCACTTAC CCCATCGTTA 
GCCCTAGGCA CAACTTATAC AGTAGACACA GTCACAGAGC TTAAAACACG ACTAGAAAAT
GCCAATAACG GCGACATTAT TAGAATTGAC GGTAATGGCG GCAATAATGG CGTTTATAGC
TATTCGGGAA GCACCTACAC TATTTACACC ACCGATGGGT GGACAAACCT TGCCCCAATG
TTTCTTATTC GCGATGCAGC CAATGTCACC ATAGAAGCAC TTAACGCTGC GAAAAAGCCA
ATACTGAAAG GTAAAGATTA CACAGACAAA TATGTGTTTT ACGGCCAGAA GGTACCTGGG
CTAACCATTA AAAATATTAA ATTTACCGAT GCCCGTAAAG GCGTGGTGAT AGATAGGTCG
AACAATATCA CCTTTGAAAA TAACGAAATT TTTGAAATTG GCGAAGAAGG TTTCCACCTG
CGCGACGGTA GCGACAACGC CATCATTCGC AATAACCATA TTCACGATAC GGGCTTGGTA
AAAGTTGATC GCGGTGAAGC TATTTACATA TGCAACGATC GCAAAATGTG GAACCCCTAC
TATCAACCCA ACCCAGTAGA AGAAAACTAC CAACAAAATT GCGATAACGC ACATATTTAC
AACAATACTA TTGGCCCGAA TATTGGTAAC AACGGTATAG ACGTTAAAGA GGGCACTGTT
GGTGTGTATG TTGAAAACAA TACGTTTAAC ATGGCTTGGA CACGCAGCGA GCTAACCAGC
ACCAGTGCCT CCGCCCACCC AAACGTTGTA ATACACTTTA AAGGCACAGA AGGTGTAGCC
ACCGGCAATA CCTTTAACTT TGCCAACGCG CCTTCTTCTT TGTTTCGCGC AATAAGTGTA
GACAGGCAAC TTGATGACGA CCCCGCGCTA AATTTGATTC ACGGCTACAA CAACTGGGCA
GTTAATAATA CGCTTAATAA TGCTCGGTCT GGCGATTATG TTTTCCACGC CGCGAGTGAT
GGTGGGGCAA CTTACGGCTG TAACAACGGC GCGCCAGATT ACGCCCCCAA TCCCAAGTCT
GATCGTATTT ACAATTCTGT AAGCGGCGCC GGTTGTACAC CGCCTACTGC CCCCACAAGT
GGAAGTTCAT CGTCATCATC TGGGTCTTCT AGCAGCACAT CGTCAAGCAG TAGCTCGTCG
TCTTCTAGCA GCTCGAGTAG TTCATCTAGC TCATCGTCGT CTAGCAGTTC GTCAAGTAGT
GGCGGCTCAT CTAGCGGTGG CAATGTGGGC AAAGAATATA ATTGGAATAA CGGCGCAGAA
AGCACCGCGT TTAAACACGT TTCGCAAAAC GTTGCTAACG GCGTAATAAC GCTAACCCTA
AGCGCCGACA ACAACGACCC CTATATGCGC ATGTACAGCA CCAATATAAA TGCCGATGTG
TACACGCATA TTGAAATGCG CGTAAAAAAT AATACACCCG GTACAGCATG GCGCATGTAT
TTCGACCCAG CAGGCTCAGC TGGTGAAGGC GGGAACTCGG TAAGCTTTAC AGTTAACAGC
AATAACACTT GGCAAACAGT AACTATAGAT ATGACTGCAG ATGCCGACTG GCAAGGCACC
ATAGACCGGA TTCGTTTAGA CCCGCAAGGT TATGTAAATG GCACCATAGA CATAGATTAC
ATTAAAGTAA TCTCACCTAG CGGAGGCAAT AACGGCGGCC AAACACACTG TATTAGTTAT
AGTGGCACCA GCTTAACCGA ACTTACTTTA AACGATGCGA GTTGTATTAC AGTGGCTGAC
GGTTTAACAA ACAAAGAAAT ATCCATTGCC GATAGCGATG CAAATAGCTC GTGCGATATT
CGCGGCAGTG CTAGCTCTAA AGACGGCACT GGCTACCATG TAATAGATGG CAATTGGGAG
AAAATAAGCG GTGGCTGGAC TGGCACCGCT ATTCAGTTCG ATGTAAGCAA TAACTGTAAA
TACCTAAAGC TACGGGTTAG GTAA
 
Protein sequence
MKNLIRCNAL LAAFALTPSL ALGTTYTVDT VTELKTRLEN ANNGDIIRID GNGGNNGVYS 
YSGSTYTIYT TDGWTNLAPM FLIRDAANVT IEALNAAKKP ILKGKDYTDK YVFYGQKVPG
LTIKNIKFTD ARKGVVIDRS NNITFENNEI FEIGEEGFHL RDGSDNAIIR NNHIHDTGLV
KVDRGEAIYI CNDRKMWNPY YQPNPVEENY QQNCDNAHIY NNTIGPNIGN NGIDVKEGTV
GVYVENNTFN MAWTRSELTS TSASAHPNVV IHFKGTEGVA TGNTFNFANA PSSLFRAISV
DRQLDDDPAL NLIHGYNNWA VNNTLNNARS GDYVFHAASD GGATYGCNNG APDYAPNPKS
DRIYNSVSGA GCTPPTAPTS GSSSSSSGSS SSTSSSSSSS SSSSSSSSSS SSSSSSSSSS
GGSSSGGNVG KEYNWNNGAE STAFKHVSQN VANGVITLTL SADNNDPYMR MYSTNINADV
YTHIEMRVKN NTPGTAWRMY FDPAGSAGEG GNSVSFTVNS NNTWQTVTID MTADADWQGT
IDRIRLDPQG YVNGTIDIDY IKVISPSGGN NGGQTHCISY SGTSLTELTL NDASCITVAD
GLTNKEISIA DSDANSSCDI RGSASSKDGT GYHVIDGNWE KISGGWTGTA IQFDVSNNCK
YLKLRVR