Gene Sde_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1098 
Symbol 
ID3968262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1408322 
End bp1409374 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content48% 
IMG OID637920166 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_526572 
Protein GI90020745 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000020957 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.395809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTA ATTTAGCTTT TGACCCAAAC GATCATAGCC ATAGACGTAA AAACCCGCTA 
ACAGGCGAGT GGGTGCTCGT ATCTCCGCAT AGAAGTAAGC GCCCCTGGCA AGGCCAGCAA
GAGCCACCAT CAACAGATAC GCCCCCCTCC CATGACCCAG ACTGCTACCT GTGCGCTGGC
AATAAGCGCA TCACCGGTGA AGTTAACCCT CAATATAGCT CAACCTTTGT GTTTACCAAT
GACTTTGCAG CAGTACAAAA AGAATCGCAT AAGGCCCCCG AGTCGACCAA TAAAGGGCTT
TTTGTTACCC AACCTGTAAG CGGTACCGCC AGAGTTATAT GCTTTTCACC AGACCACAGC
AGAAGTCTAC CGCTTCTTTC TATAGAGGAG CTCAATGCTG TCGTATCTTG CTGGCAAGAG
CAGTTAACCG AATTGAAGGA AAGCTGTAGC TGGGTGCAAA TTTTTGAAAA CAAGGGGGCG
GCCATGGGGT GCTCCAACCC CCACCCGCAT GGGCAAATCT GGGCCACAGA TCAAATACCT
ACTAAAGCTG AGAAAGCTGA CAAAAACCTT AAGGCTTATT TCGATGAGCA TGGCTCAAAC
TTACTTCTGG ACTATGTAGA AGAAGAACTA GCCAGCGGAG AACGCATTGT CGACGCCAAC
GAAGACTGGG TCGCACTTGT CCCCTACTGG GCATGCTGGC CGTTCGAAAC ACTACTGCTT
CCAAGAAAAC ACATTAAGCA TTTAGATGTA TTAACCGAGC AGCAAAAAGA AAACCTATCA
AAAATTTTAA AAGCAACGCT TAGCCGCTAT GACAATCTTT TCAACTGCAG CTTCCCCTAC
TCCATGGGAT GGCACGGCCA ACCTTTCGAC GGAAAAGATA ATACCCACTG GCAGGTGCAC
GCACACTTTT ACCCGCCTTT ATTACGCAGC GCCTCAGTTA AGAAATTTAT GGTTGGCTAC
GAAATGCTAG CAGAAGCCCA AAGAGATATA ACACCAGAAC AAGCCGCGGC GCGCTTGCGC
GAATGCTCCA CCACCCACTA TTTAGCGAAG TAA
 
Protein sequence
MSSNLAFDPN DHSHRRKNPL TGEWVLVSPH RSKRPWQGQQ EPPSTDTPPS HDPDCYLCAG 
NKRITGEVNP QYSSTFVFTN DFAAVQKESH KAPESTNKGL FVTQPVSGTA RVICFSPDHS
RSLPLLSIEE LNAVVSCWQE QLTELKESCS WVQIFENKGA AMGCSNPHPH GQIWATDQIP
TKAEKADKNL KAYFDEHGSN LLLDYVEEEL ASGERIVDAN EDWVALVPYW ACWPFETLLL
PRKHIKHLDV LTEQQKENLS KILKATLSRY DNLFNCSFPY SMGWHGQPFD GKDNTHWQVH
AHFYPPLLRS ASVKKFMVGY EMLAEAQRDI TPEQAAARLR ECSTTHYLAK