Gene Sde_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2649 
Symbol 
ID3968507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3350602 
End bp3352212 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content47% 
IMG OID637921747 
Productglutamate synthase, NADH/NADPH, small subunit 2 
Protein accessionYP_528121 
Protein GI90022294 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000123588 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0278702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTAT CATCGCGTAC CGCATTTCAG TTTGCCGCCA TTGTTGCAAT GGGTGGGTTT 
ATCTTTGGGC TAGATGCCGC GCTTATCTCC GGCACGGTGC GCTTTGTTAC TGCCGAGTTT
GGGTTGAATG ATTTGCAGTT GGGTTCAGTG GTTAGCGCAC CGGGTTTCGG TGTGTTGTTT
GCGCTAATTG CTGCGGGGCC TTTGTGCGAT CGAATTGGTA GAAAGTACAC TCTTATTATT
ATTGCTGCGC TCTATGTTTT ATCCGCGGTG TTTTCTGTAA TCGCCCCCAG TTACGAGGCA
CTTGTAGCGG CGCGCTTTAT TGGTGGTTTG GCCTTCGCCT CTCTCTCTTT GGCTTCTATG
TATATTGGAG AAATCGCACC AGCGGACATG CGTGGCAAGT TAGTTTCTAT GAACCAAATT
ACTATTGTGG TTGGGTTAAC GGCGGCTTAT TTCTTTAATT ATTTTTTATT GGAGCTTGCG
GGGTCGGGTG CCCAGTGGGT GCAAACCGTT GGCCTTCAGC AGAACCTTTG GCGCTGGATG
CTGGGCGTAG AAATTTTACC TGCACTTGTG TGGTTGTTGC TGCTGTTGCG CATTCCAGAA
AGCCCGCGTT GGTTGATAGC TAAAAATAGA ATTGACGATG CGAAGCACGC TTTGCAAAGG
TTGGTGCCGG CCGAGAAGGT GGAAGAAAAT TTAGCTTCTA TTTGCAATGC CGTGCATGGT
GAAGTGTTGG CCCCGTCTTT TGCTAAACAA TTTTCTATGT TGTTTGATTC TCGCTTAAAG
CGCGCAATGA TAGTTGGTTT GGCGTTTGCC ATAGTGCAGC CCATTACCGG TGTGAATGCA
ATTTTGTTCT ATGCGCCTAT GGTGTTTGAG CAGACTGGTG TAGGAACTAA TGCCGCATTT
ATGCAAACAA TTATCGTTGG GCTTGTGAGC TTGGTGTTTA CGGTGGTGGC ATTATCGTGC
ATAGATAAAT TTGGCCGCAG GCCGCTGGTT ATTGTTGGTT TGTTGTGGTC GGTGCTCAGT
TTATTTATGT GTTTCTGGGC GTTTAACGAG GCTACCTATT TGTTGGAGGC TTCACACCTT
GCTGAGTTGG CGGGCACTTT AGATCAGGCG CAATTGGGTG CCCTGCAAAA TATGATAGGG
GTGCAATATA GCAGCGACCT AGAATACAAA GCTGCATTGG CGCTTAATTT GGGGGCGGAA
TCTGCACGCG CTTACGAAGG GGTGCTTATT CAATTGGCGG CAACCATGAA TGGCTGGTTG
GTAATTCTAG GTATTATTGG TTTTATTGCT GCCTTTCACC TTTCTATTGG GCCTATTATG
TGGGTAGTGT TTTCCGAAAT TGTTCCCATT CATGTGCGTG GTGTGGCTAT CCCCATGTTT
GCATTTGTAA CCAGTTTGGT GAGTTTCTTT GTGCAAAAAC TGTTTCCATG GCAGTTAAAT
GTTATGGGCG CGGCAGAAAT ATTTTTATTT TACTGTTTGT CTGGTGCGGC CGGTTTGGTG
TTGTTGTGGT GGTTTTTGCC AGAAACTAAG GGCAAAACAA TCGAGCAAAT TGCCGATGGT
TTAGCGGGAG AGCAGCCCGC TTCATCTCGC GTGTCGACTT CTGCAAGCTA G
 
Protein sequence
MGVSSRTAFQ FAAIVAMGGF IFGLDAALIS GTVRFVTAEF GLNDLQLGSV VSAPGFGVLF 
ALIAAGPLCD RIGRKYTLII IAALYVLSAV FSVIAPSYEA LVAARFIGGL AFASLSLASM
YIGEIAPADM RGKLVSMNQI TIVVGLTAAY FFNYFLLELA GSGAQWVQTV GLQQNLWRWM
LGVEILPALV WLLLLLRIPE SPRWLIAKNR IDDAKHALQR LVPAEKVEEN LASICNAVHG
EVLAPSFAKQ FSMLFDSRLK RAMIVGLAFA IVQPITGVNA ILFYAPMVFE QTGVGTNAAF
MQTIIVGLVS LVFTVVALSC IDKFGRRPLV IVGLLWSVLS LFMCFWAFNE ATYLLEASHL
AELAGTLDQA QLGALQNMIG VQYSSDLEYK AALALNLGAE SARAYEGVLI QLAATMNGWL
VILGIIGFIA AFHLSIGPIM WVVFSEIVPI HVRGVAIPMF AFVTSLVSFF VQKLFPWQLN
VMGAAEIFLF YCLSGAAGLV LLWWFLPETK GKTIEQIADG LAGEQPASSR VSTSAS