Gene Sde_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1014 
Symbol 
ID3967768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1295551 
End bp1296969 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content48% 
IMG OID637920081 
Productarabinan endo-1,5-alpha-L-arabinosidase 
Protein accessionYP_526488 
Protein GI90020661 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000194163 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTT GTCGAAAACT CTATCAACTA GCTACAACAG TAGTACTTCT ATGTATGTGC 
GCATTTGCCA ATGCCCAGTT ATCTAATGGC GTGTACTCCA TTACTTCTAA GTTAAGTGGT
AAGCCTATAG AAATAACGGG GGCGTCTACG GCGGCTGGCG CAAATGTAAT TCAGTGGGCG
AATAATGGCG GTGATCATCA AAAATGGATT GTCACTCACG AAGGCAATGG CGACTACTCC
ATAATTAACT TGTTAAGTGG TATGGCGCTG GAGGTGTTTG ATTTTTCCAC GGCAGATGGC
GGCAATGTTG TGCAGTATGA TTTTTGGCAT GGCGACCCGC AGTTGTGGAC TTTAAGCAGC
CAGGGCAATG GCTATTATGC CGTGCTAAAT AAACACAGCG GCAAAGCGTT AGATTTGTAT
GGTTTTGATA CGTCTAACGG CGCGAATATT GCGCAATGGG CCTTTTGGGG CGGGGACCCG
CAGCAGTGGC AATTTACCAA AATCGCCAAT GTAGGTGCGC CGCCAGTAGA TACATCTACC
ACCAACGGTG CAACCAACCA CTGGTCCTTA ACCGGTAATC TAGTGACTCA CGACCCCACA
ATGGCCTACG AAAACGGCTC ATGGTGGTTG TATCAAACCG GCGAGGGAAT TTACGGTAAG
TATTCAGCCA ATGGTTTGGC GTGGGATGGC TTACCTTCTG TGTTTCCCAA TGGTTTAAGT
TGGTGGAAGA CCTATGTACC CGGCCAGTCG AACAACGATG TATGGGCGCC TGATGTACGC
ACTTATAATG GGCGGGTTTA TTTGTACTAT TCCATCTCTA CTTTTGGCTC GCGTGTATCT
GCCATTGGTT TGGCGTCGGC ATCGAGTTTG GCTGCGAGTG ATTGGCAGGA CCACGGCTTA
GTAATTAATA CCACCTCATC TAGCGATTGG AATGCGATCG ACCCAGATTT AGTGGTCGAT
GAGCATGGCA ACCCTTGGTT AACAATGGGA AGTTGGAACA GCGGTATTAA AGTGATGCGC
TTGAACCCCA TTACCATGAA GCCAATTGGC ACACTTTATT CTATTGCGCA AAAGGGCGGC
GGTATTGAAG CGCCTTCTAT TGTGTATCGC CGTGGGTATT ACTATTTATT TGTTTCTATC
GGCAAATGCT GTGCGGGCGT AGATAGCACC TATCAAATTG CTTACGGGCG CTCTACAAGT
ATTACCGGCC CTTATTTGGA TAAGAACGGC AACGATATGA TGAGTGGTGG TGGCAGTATT
TTAGATGCGG GCAACAACGT GTGGGTTGGC CCTGGTGGGC AAGATATTAT TAACACCGAT
GTCATTGTGC GCCACGCGTA CGATGCCACA GATGCAGGCA CACCTAAGAT GATTATTAGT
ACCTTGAATT GGGATGCTAA TGGATGGCCG AAATACTAG
 
Protein sequence
MNLCRKLYQL ATTVVLLCMC AFANAQLSNG VYSITSKLSG KPIEITGAST AAGANVIQWA 
NNGGDHQKWI VTHEGNGDYS IINLLSGMAL EVFDFSTADG GNVVQYDFWH GDPQLWTLSS
QGNGYYAVLN KHSGKALDLY GFDTSNGANI AQWAFWGGDP QQWQFTKIAN VGAPPVDTST
TNGATNHWSL TGNLVTHDPT MAYENGSWWL YQTGEGIYGK YSANGLAWDG LPSVFPNGLS
WWKTYVPGQS NNDVWAPDVR TYNGRVYLYY SISTFGSRVS AIGLASASSL AASDWQDHGL
VINTTSSSDW NAIDPDLVVD EHGNPWLTMG SWNSGIKVMR LNPITMKPIG TLYSIAQKGG
GIEAPSIVYR RGYYYLFVSI GKCCAGVDST YQIAYGRSTS ITGPYLDKNG NDMMSGGGSI
LDAGNNVWVG PGGQDIINTD VIVRHAYDAT DAGTPKMIIS TLNWDANGWP KY