Gene Sde_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2031 
Symbol 
ID3967294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2559095 
End bp2560714 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content46% 
IMG OID637921119 
ProductABC transporter ATP-binding protein 
Protein accessionYP_527503 
Protein GI90021676 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00115375 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACC TAATAGTTAA AAATTTAACC ACCCACTTTT ATACACGCGA AGGTTTAACC 
ACCGCTGTAG ATAACGTTAG CTTTACCCTA GAAGCGGGCC AAATACTTGG CATAGTTGGC
GAATCCGGCT CGGGTAAATC TGTTGCTTGC TACAGTTTGC TAGGTTTAAT TCCTAGCCCA
CCGGGCAAAG TGGTTAATGG CAGCGCTCTG TTTAACGGCG AAGACCTTCT TACTAAAACA
GAAGCGGAAT TACGCAGTGT GCGCGGCAGA AAAATTAGCA TGATTTTTCA AGACCCTATG
ACCTCCCTCA ACCCGCACAT GCGTATAGGT GATCAACTAA TAGAAGCCTA TCGCTTGCAC
CACAAAAGCA CTAAAAAACA GGCAACCGAA AAGGCGATAC AGCTTCTACA AGAAGTGGGT
ATTAAAGACG CCGACACACG CATACGTGCC TACCCTCATG AATTTTCTGG CGGTATGCGA
CAGCGCGCCA TGATCGCCAT GGCTTTGATT ACCGAGCCAG AATTATTAAT AGCCGACGAA
CCAACAACTG CACTGGACGT AACCGTACAA GCGCAAATTC TACAGTTGAT TAAATCGATC
CAGCAGAAGC GACACCTAAG TGTCATTTTT ATTTCGCACG ACTTAGCTGT AGTTTCACAA
ATTGCCGACC AGCTTATCGT TATGAAAGAA GGTAAGGTGG TAGAAAGCGG TGCAACTGCG
AGCGTTTTTA GCGAGCAAAA GCACCCTTAT ACAAAAAAAT TAATTGCTGC TATTCCCAAT
AAAGCCAAGC AGGTTAAATA TACCGCTACC GAAACCAACC CTTTGTTAAC GGTTAATAAT
CTTTCCACCA GCTTCGCGCA AGAAACCACC AGTTGGTTTG GCAAAAAGGC CGCCCGTAAG
GTAGTGGTAA AAGATATTAG CTTTTCAATT CAGCAAGGCG AAATACTCGG GTTAGTTGGT
GAGTCTGGCT CGGGTAAATC TACCCTTGGT CGCAGTGTTA TTAAATTAAT TAACGCCGAT
AACGGCGAAA TAAACATAGA CCAACATTGC ATTCACACCT TGCAAGGTGA CAAGCTAAAA
CAAGCCCGCA AAGATTTTCA AATGATCTTT CAAGACCCGT ACGCGTCGCT TAACCCAAGG
TTAACGGTAT TTGACGCGCT GGCAGAACCG CTGCTTTTGC ACGGCATTGC CAACAAAACC
AATGTGGTGG AGAAAGTTAA CACCTTAATG GATGACGTAG GCCTTGCCCG TAAGTTTGTG
CGCAAATACC CCCATGAGTT TTCTGGTGGC CAGCGCCAGC GTATAGCCAT AGCTCGCGCC
TTAGCCCCAC AACCAAAGCT CATAATTGCC GATGAGCCCG TATCGGCGTT GGATGTAACC
ATCCAAGCAC AAATATTAGA GCTACTGCTT AACCTTACTC AAAAGCACTG CCTCGCTATG
TTATTTATTT CGCACGATTT AGCCGTTGTG CGCTACCTGT GCGACCGAGT AATGGTTATG
CACAACGGCA ACCTTGTAGA GCAAGGGCCT ACCGAAGACA TTTATAACCA GCCCACTCAC
CCTTATACCC AAACGTTAAT TAGCGCGATT CCAACTTTTA TGACACAAAA TATGCACTAA
 
Protein sequence
MSHLIVKNLT THFYTREGLT TAVDNVSFTL EAGQILGIVG ESGSGKSVAC YSLLGLIPSP 
PGKVVNGSAL FNGEDLLTKT EAELRSVRGR KISMIFQDPM TSLNPHMRIG DQLIEAYRLH
HKSTKKQATE KAIQLLQEVG IKDADTRIRA YPHEFSGGMR QRAMIAMALI TEPELLIADE
PTTALDVTVQ AQILQLIKSI QQKRHLSVIF ISHDLAVVSQ IADQLIVMKE GKVVESGATA
SVFSEQKHPY TKKLIAAIPN KAKQVKYTAT ETNPLLTVNN LSTSFAQETT SWFGKKAARK
VVVKDISFSI QQGEILGLVG ESGSGKSTLG RSVIKLINAD NGEINIDQHC IHTLQGDKLK
QARKDFQMIF QDPYASLNPR LTVFDALAEP LLLHGIANKT NVVEKVNTLM DDVGLARKFV
RKYPHEFSGG QRQRIAIARA LAPQPKLIIA DEPVSALDVT IQAQILELLL NLTQKHCLAM
LFISHDLAVV RYLCDRVMVM HNGNLVEQGP TEDIYNQPTH PYTQTLISAI PTFMTQNMH