Gene Sde_2428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2428 
Symbol 
ID3966648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3073689 
End bp3074768 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content51% 
IMG OID637921519 
Productdihydroorotase 
Protein accessionYP_527900 
Protein GI90022073 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACAG GCTTGAATGC CAATAAGCAG CAAGAACTCA TGTCTGAAAA AATTACTCTT 
ACCGCCCCCG ATGACTGGCA CATTCACCTG CGCGACGGCG ATGCCCTCGC ATACACCGTA
TCCGATGCCG CCCGCAACTT TCGCCGCGCC ATTATTATGC CTAACCTTGT GCCCCCCGTA
CTAAACGCCA AGCAGGCACT AGATTACAAA GCACGCATTC TTGCCCACGC ACCGGAAGAC
GCCGACTTTA CCCCGCTAAT GGTGTTATAC CTTACCGAAA AAACCAGCCC CGCAGACATA
GCAGAGGCAG CGGCCAAAGG CATAGTGGCC TGTAAGCTCT ACCCCGCCGG CGCCACAACC
AATTCAGACT CAGGCGTTAC CGACATAAAA AACTGTTACG ACGCCCTTGC TGCCATGCAA
GAGCACAATA TTAAGCTTTT GGTACACGGC GAAGTTACCG ATGCAGATAT CGATATTTTT
GACCGCGAAG CAACCTTTCT AAGCCGCACA ATGGAGCAGC TTGTTAAGGA CTTCCCAACG
CTAAAAATCG TTTTAGAGCA CATTACTACC GAAGATGCGG TCAAGTTTGT ACTTAAATCA
GGCCCCAACG TAGCGGCAAC CATTACTGCG CACCACCTTT TGTACAACCG CAACCACATG
CTCGCAGGCG GTATACGCCC ACACTATTAC TGTTTGCCTA TTTTAAAGCG CAGCTCGCAC
CAGCAAGCTC TAATTAGTGC TGCAATCAGT GGTAACCCAA AGTTCTTTTT AGGCACCGAC
TCTGCCCCCC ACGCAAAAAG TAAAAAAGAA GCTGCTTGTG GCTGTGCAGG TAGCTACACC
GCTTTTGCCG CGCTACCACT TTATGCTGAG GCATTCGAAG AAGCCGGCGC ACTCGACAAA
CTAGAAGATT TTGCCAGCCA TTTTGGCCCC GACTTTTACG GCCTACCGCG CAACACAACA
AAAGTGACCC TCACCAAGCA AGCATGGACA GTACCGTCCA ACCTGCCCTT CGGAAGTGAT
ACGCTAGTGC CAGTTAAAGC TGGCGAAACG TTGAATTGGA CTCTGACACC AGCGGAATAA
 
Protein sequence
MATGLNANKQ QELMSEKITL TAPDDWHIHL RDGDALAYTV SDAARNFRRA IIMPNLVPPV 
LNAKQALDYK ARILAHAPED ADFTPLMVLY LTEKTSPADI AEAAAKGIVA CKLYPAGATT
NSDSGVTDIK NCYDALAAMQ EHNIKLLVHG EVTDADIDIF DREATFLSRT MEQLVKDFPT
LKIVLEHITT EDAVKFVLKS GPNVAATITA HHLLYNRNHM LAGGIRPHYY CLPILKRSSH
QQALISAAIS GNPKFFLGTD SAPHAKSKKE AACGCAGSYT AFAALPLYAE AFEEAGALDK
LEDFASHFGP DFYGLPRNTT KVTLTKQAWT VPSNLPFGSD TLVPVKAGET LNWTLTPAE