Gene Sde_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0544 
Symbol 
ID3967887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp663933 
End bp665582 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content50% 
IMG OID637919607 
Productputative allophanate hydrolase 
Protein accessionYP_526020 
Protein GI90020193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1 
TIGRFAM ID[TIGR00370] conserved hypothetical protein TIGR00370
[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00179093 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCACA TACAAACATT GCAGCGTATT AACGACCACA GCCTACTTGC TGCCTTTAGC 
CAAGGTTTTA GCCAAGCCTT AAGCTACGGC CAAATAAGTC AGCTTGCCGG CTGGTTAAGA
CACAATATTA CCAGCATAAA AGAAGCGGTA CCGGCCTACA GCCAAGTATT CATTGAGTGG
GATGACACGC TAACCTATAT CGACATCAGC CAAGCGCTCA ACGCAGCGCT AAAAAATATA
CAAACGATTA CGCAGCCCCA TGCCACGGCA GAAGCCCCTC AGCCCTTAAC AGTACAAGTG
TGCTACCACC CAAGCGTAGC CCCCGACCTA TATAACGTGG CCACAACACT TGGCTTGAGC
GAACAAGACG TTATTCGCCA GCACCTAAAA GCAAACTACC AAATACAAGC CATTGGCTTT
ATGCCTGGCT TCGCCTACTT AGGTGGCCTG CCTGACAATT TGCGTATACC GCGCCGCGCC
AACCCTCGTA CGCAGGTGCC TGCCGGCAGT GTAGCCATAG CCGAAAACCA AACGGCGGTA
TACCCCTTAG CTGCCCCAGG TGGTTGGCAC TTAATTGGCT GTAGCCCAGC CCCGCTAGGG
GACCCAAACA TATTAGAAAG TGCACACAAA CAAAAAGAAA ATAACACCCT AAGCGTAGGC
AAAGCCGTTA AATTTGAGCA GATTAGTCTC GAGCAATTTA ATCGCATTGC GCAGCAGCGT
GAACAATCAA AACCTCAAAA CCATTTACCA TCACAAAGCA CATTTACCAC ACGCAACCCA
ACAAGCCACA TGACTATTTT GCAAACTGGG CCATTGGCCC TACTGCAAGA TACTGGACGC
AAGCAGGTGC AGCACTTAGG CGTAAGCCAG TGCGGCAGCT TAGACAACTA CGCCTACCAA
TGGGCCAATA AGTTATTAGG AAATGCCCCA AATAGCCCCG CCATAGAGAT AACGCTTGGC
CTGTTTAAAG CGCGCTTTAA CGCTCCTACT ACCATTGCCA TTGCAGGGGC AGATTGCCAC
GCAACGCTTA ACAACCGGCC ACTGCACAAC TGGGCCAGCC ACCGTGTAAA TACAGGTGAC
ACGCTCGCCT TTGGCGGCGC ACGCAGTGGT GCTCGCGCAT ATTTAGCGGT TGCAGGTGGG
TTTGAGTGTG CAACTTTGTT TGATAGCTGC GCAATGAACC CCAAAGAACA TTGGCCGCAA
ACAGCAAGCC AGTTAAGCGC CAAACAGAAT GTTGGCTATA GAAGTAACAG CCAGCCACCA
CTAAAAATGG CACCGTGGTA CAAACAGCCC AATTACCAAC AAGAGCTTGT ATTGAGGGTT
TACCCGAGCT ATCAATTTGA TCAGTTCGGC GCAAATGAAA TAGACAAGCT GTGCAGTGAG
AACTACCAGA TTCACTCCCA CAGCGACCGC ATGGGCTACC GCTTAGAAGG CGCAAATATT
CACTGGCAGC ACGGCGGTAT AAGCTCTGAG CCTATTGCCT TTGGCAGCGT GCAAATTCCA
CCAAGCGGGC AACCTATAGT GCTGCTTAAC GACAGGCAAA CCCTTGGGGG CTACCCTAAA
ATTGGTTGCG TAAAAGCGGA AGACTGCTGG CAATTAGCGC AGCGACAAGC AGGGCAAAGT
GTGCGATTTG AGTTTATAGA GTGGGATTAA
 
Protein sequence
MTHIQTLQRI NDHSLLAAFS QGFSQALSYG QISQLAGWLR HNITSIKEAV PAYSQVFIEW 
DDTLTYIDIS QALNAALKNI QTITQPHATA EAPQPLTVQV CYHPSVAPDL YNVATTLGLS
EQDVIRQHLK ANYQIQAIGF MPGFAYLGGL PDNLRIPRRA NPRTQVPAGS VAIAENQTAV
YPLAAPGGWH LIGCSPAPLG DPNILESAHK QKENNTLSVG KAVKFEQISL EQFNRIAQQR
EQSKPQNHLP SQSTFTTRNP TSHMTILQTG PLALLQDTGR KQVQHLGVSQ CGSLDNYAYQ
WANKLLGNAP NSPAIEITLG LFKARFNAPT TIAIAGADCH ATLNNRPLHN WASHRVNTGD
TLAFGGARSG ARAYLAVAGG FECATLFDSC AMNPKEHWPQ TASQLSAKQN VGYRSNSQPP
LKMAPWYKQP NYQQELVLRV YPSYQFDQFG ANEIDKLCSE NYQIHSHSDR MGYRLEGANI
HWQHGGISSE PIAFGSVQIP PSGQPIVLLN DRQTLGGYPK IGCVKAEDCW QLAQRQAGQS
VRFEFIEWD