Gene Sde_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0995 
Symbol 
ID3965120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1276856 
End bp1278271 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content51% 
IMG OID637920062 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_526469 
Protein GI90020642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000812271 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTATTC GCAGCAGCCT ATTGTCCTCC GGTCTGTTAA AATGCACGCC TTTTTTGGGG 
GATGATCGTT TGGCAAGTGA AGTTCGGGCT GTAAGCTCGC TAGCATTGTT GTATGCGTTT
AGGATGTTGG GCCTATTTAT GGTGCTGCCT ATTCTCGTGC TTTATGCCGG CGACTACCCA
GGTGCAACGC CTTTTACTTT GGGGTTGGCA CTTGGTATTT ATGGTTTAAC TCAAGCTGTC
TTTCAAATTC CCCTTGGTTT GCTCTCAGAC TTTATTGGCC GCAAACCCGT TATTATTGCA
GGCTTGTTGG TGTTTTGTGC CGGCAGTGTG CTTGCAGGTA CGGCCGAATC GGTAGAGTGG
TTAATTATTG GTAGAGCCCT ACAAGGCAGT GGCGCCATTG CAAGTACCAT TATGGCCATG
GTGGCCGACC TTACCTCTGA GCAAAACCGC ACCAAAGCCA TGGCCGCTAT TGGCGCTTCT
ATTGGGTTGT CGTTTTCGCT GGCGATGATT TTAGGGCCTA CGGTGGGCGC GTTTGGTGGC
TTATCGGTGG TGTTTTATTT TTCTGCGGTA CTGGCGCTTA TTGGTGTGTG CATTGTGATC
TTTTTAGTGC CTCGCCCGCC GCAAGTAGGG CACTCTCACC GCGACAGTGG CGCGGTGCCA
GAGCTTATTA TGCAAACCCT TAAAAACACC GAGCTGCTGC GTTTAAATTT CGGTATTTTT
ACTTTGCACG CGTTGCTAAT GGCCTGTTTC TTGGCAATTC CTGTTGTAGT GGAAAGTAGT
TTAGGTATTC CTCGGGGTAA GCATTGGCAG GTTTACTTGC CAATGTTGGC TATCGCGTTG
GGCGTAGTGC TGCCGCTCAT TATGGTTGCT GAGCGCAATC GCAAGTTAAA GCCCGTTTTC
TTGTTTGCCA TTGCTGTGCT TGTTGTTTCG CAAGCTAGCT TAGCGGTTGT TCCGTTGGCG
GGCTGGCCGT TTTTATTGCT AATGCTGCTG TTTTTCGTGG CGTTTAACCT GCTGGAGGCT
TGTTTGCCTT CGTTGGTCAG TAAGCTTGCA CCTGTAGGTG CCAAAGGCAC GGCGATGGGG
GTGTATTCCA CTAGCCAGTT TTTAGGGGCG TTTATTGGCG GGTCTGTTGG AGGTTACATT
TTTACCCTGT GGGGTATGGA TGGCCTGTTC GCTGCTGGCG CGCTTTGCGC TGCCGCTTGG
TTCGCAGTAG CTGCATCTAT GCGCACTCCG CGCCACTTAA GCAGCATGTG TATGGGCGTT
CAGCGTGAAA GCGGCGCACA GTGTGCGATT GATGTACTCA CATTGCCGGG CGTAGTAGAG
GCTCTGTGGG TCGAGCAGGA AGGTTTGTTG TATTTAAAAG TAGATAATCG AGAGCTGGAT
AGGTCCCAGT TGGATGATTT GATTGCGGCG CAATAA
 
Protein sequence
MGIRSSLLSS GLLKCTPFLG DDRLASEVRA VSSLALLYAF RMLGLFMVLP ILVLYAGDYP 
GATPFTLGLA LGIYGLTQAV FQIPLGLLSD FIGRKPVIIA GLLVFCAGSV LAGTAESVEW
LIIGRALQGS GAIASTIMAM VADLTSEQNR TKAMAAIGAS IGLSFSLAMI LGPTVGAFGG
LSVVFYFSAV LALIGVCIVI FLVPRPPQVG HSHRDSGAVP ELIMQTLKNT ELLRLNFGIF
TLHALLMACF LAIPVVVESS LGIPRGKHWQ VYLPMLAIAL GVVLPLIMVA ERNRKLKPVF
LFAIAVLVVS QASLAVVPLA GWPFLLLMLL FFVAFNLLEA CLPSLVSKLA PVGAKGTAMG
VYSTSQFLGA FIGGSVGGYI FTLWGMDGLF AAGALCAAAW FAVAASMRTP RHLSSMCMGV
QRESGAQCAI DVLTLPGVVE ALWVEQEGLL YLKVDNRELD RSQLDDLIAA Q