Gene Sde_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3209 
Symbol 
ID3965682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4085115 
End bp4086335 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content46% 
IMG OID637922306 
Producthypothetical protein 
Protein accessionYP_528678 
Protein GI90022851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATT TTTACCAGAA CGGTATAATT ACAACATTAC ATAATTTAAA ACAGCGCCCA 
CTAGAGGCCA TGGAAGACGA ACTGCGAAAA TTTTCCAAAA CTCGCCCTAT GGCCCTCGTT
TTACCTTCGC TATTCTCCGA GCTAGAAGGC GATGCACTGC CCAATATTGT TAACGAACTA
AGTAAAGTAG ATTACCTACA AGAAATAGTT ATTGGCCTAG ATCGCGCAGA CGAAGATCAA
TACCGCCGCG CATTAGAGTT TTTTAAACCG CTCAAACAAA ATTTTAAAGT ACTGTGGAAC
GACGGCCCAC GCCTGCGCGC CATCGACCAA CGCTTAAAAG ACGAAGGCCT AAGCCCTATG
GAAGCCGGCA AAGGACGCAA CGTATGGTTT TGCTTGGGCT ATGTATTGGC AAGCGGCCAA
TCGCAATCGG TAGCTCTGCA CGATTGCGAT ATAGTTACCT ACGATCGCAG CTTACTCGCG
CGCCTTATTT ACCCCGTAGC CAACCCCAGC TTTAACTACG AGTTTTGCAA AGGTTTTTAC
GCCCGCGTTG CCAACGGCAA AATTCACGGC CGCGTTAGCC GCTTATTGGT TACGCCATTA
ATTCGCGCCT TGAAAAAAAC ACTGGGCCAC TACGATTATT TAGATTACAT CGACAGCTTC
CGCTACCCAC TTGCGGGGGA GTTTTCGTTC CGCACCGATG TAATTACCGA CATTCGCATC
CCCAGTGATT GGGGCTTAGA AATAGGCGTG CTTTCGGAGC TTAATCGCAA CTATGCCAAC
AACAGAATAT GCCAAGCAGA TATTGCCGAC ACCTACGATC ACAAGCATCA AGATCTCTCG
GCAGAAGACG CAGAAAAAGG CTTATCCAAA ATGTCTATCG ACATATCAAA AGCCCTATTC
CGCAAGCTTG CCACCAACGG CGTAGTGTTT AACTCAGAAA CATTTCGCTC TATTAAAGCC
ACCTACTTCC GCATAGCGTT AGATTTTGTA GAAACCTATT ACAACGATGC AGTAGTAAAC
GGCCTTAAAT TAGATATTCA CAGCGAAGAA CGCGCAGTAG AATTATTTGC CCGCAATATT
TTAGAAGCCG GCAAGCGCTT TCTTTCCAAC CCAATGGAAA AACCATTTAT TCCCAGCTGG
AACCGCGTTA CCAGTGCAAT ACCCGGCATT TTAGAAGACA TAAATGCAGC GGTAGAAGCC
GATATGGCAG ATTTTCAATA A
 
Protein sequence
MGDFYQNGII TTLHNLKQRP LEAMEDELRK FSKTRPMALV LPSLFSELEG DALPNIVNEL 
SKVDYLQEIV IGLDRADEDQ YRRALEFFKP LKQNFKVLWN DGPRLRAIDQ RLKDEGLSPM
EAGKGRNVWF CLGYVLASGQ SQSVALHDCD IVTYDRSLLA RLIYPVANPS FNYEFCKGFY
ARVANGKIHG RVSRLLVTPL IRALKKTLGH YDYLDYIDSF RYPLAGEFSF RTDVITDIRI
PSDWGLEIGV LSELNRNYAN NRICQADIAD TYDHKHQDLS AEDAEKGLSK MSIDISKALF
RKLATNGVVF NSETFRSIKA TYFRIALDFV ETYYNDAVVN GLKLDIHSEE RAVELFARNI
LEAGKRFLSN PMEKPFIPSW NRVTSAIPGI LEDINAAVEA DMADFQ