Gene Sde_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1980 
Symbol 
ID3967223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2490601 
End bp2491809 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content43% 
IMG OID637921068 
Producthypothetical protein 
Protein accessionYP_527452 
Protein GI90021625 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0097799 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAA AACTACCTGT TACCGTTTTA TCTGGCTTTC TTGGCGCAGG TAAAACTACC 
GTGCTAAACC ATATTTTAAA TAATCGCGAC CATTTACGTG TTGCCGTAAT TGTTAATGAT
ATGAGTGAAG TAAACATAGA TGCAGCAACA GTTAAAAATG AAGTAACACT CAACCGCAGC
CAAGAGAAAT TGGTCGAAAT GAGTAATGGC TGTATTTGCT GCACATTGCG AGAAGACTTG
CTAATTGAGG TAAACAAACT TGCGAAAGAA GGGCGATTTG ATTACCTAGT AATTGAATCC
ACCGGTATTT CTGAGCCGTT GCCTATCGCT GAAACGTTCA CATTCGCCGA TGAAACAGGT
GTAAGCCTTT CCGACGTAGC AAGGCTAGAC ACCATGGTAA CTGTTGTGGA TGCCGCTAAC
TTTCTTAATG ATTATGACGA AGCTAAGTAC CTACAGGAAA CCAGTGAAAG CCTAGGCGAT
GACGATGAGC GCACGGTTGC AGACTTGCTA GTAGATCAAA TTGAATTTGC CGACGTAATA
CTCATTTCTA AAAGCGACGT AGTAAGCAAT AAACACTTAG CCCGCACGCA AGCTGTACTA
CAAACACTTA ACCCCGAAGC AAGTATCCAT ACAATTGCAA ACGGTAAAGT AAACGTAAAG
ACTGTATTAG CTACAGGAAA ATTCAGCTTT GACAAAGCGC AGCAATCAGC GGGCTGGCTA
AAAGAAATGC GCGGAGAGCA TATTCCAGAA ACCCAAGAGT ACGGTATTAG CAGCTTTGTT
TATCAAGCCC GTAAGCCATT TCACCCGCAA AAATTTTATA ATTTTTTGCA CAGCGAACAA
CTTGCAGGGA AACTCCTGCG TTCAAAAGGT TATTTTTGGC TGGCGACTCG GCCAGAAGCC
GCTGGGCAGT GGAATCAAGC CGGTGGTATT GCACGGTATG GTTTTGCTGG CATGTTTTGG
AAAGCAGTAC CCAAAGAAAA TTGGCCCGAT GACGAAGACT ACCTTGCATC TATAAAAAAG
AGTTGGGAAG AGCCATTTGG AGATATGCGC CAAGAGCTTG TGTTTATTGG TCAAGGCCTA
GACAAACAGG CTGTAATTGA GGCGCTAGAT AAATGTTTAT TAACGGAAAA AGAATTGCTT
GCAGGCAAGG ACTATTGGTT AGGTTTAGAT GATCCGTTTC CAGCTTGGAA CGACAAAGAA
GCCGCTTAG
 
Protein sequence
MNQKLPVTVL SGFLGAGKTT VLNHILNNRD HLRVAVIVND MSEVNIDAAT VKNEVTLNRS 
QEKLVEMSNG CICCTLREDL LIEVNKLAKE GRFDYLVIES TGISEPLPIA ETFTFADETG
VSLSDVARLD TMVTVVDAAN FLNDYDEAKY LQETSESLGD DDERTVADLL VDQIEFADVI
LISKSDVVSN KHLARTQAVL QTLNPEASIH TIANGKVNVK TVLATGKFSF DKAQQSAGWL
KEMRGEHIPE TQEYGISSFV YQARKPFHPQ KFYNFLHSEQ LAGKLLRSKG YFWLATRPEA
AGQWNQAGGI ARYGFAGMFW KAVPKENWPD DEDYLASIKK SWEEPFGDMR QELVFIGQGL
DKQAVIEALD KCLLTEKELL AGKDYWLGLD DPFPAWNDKE AA