Gene Dred_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_1336 
Symbol 
ID4957300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp1432544 
End bp1433554 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content45% 
IMG OID640180515 
ProductL-asparaginase II 
Protein accessionYP_001112695 
Protein GI134299199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4448] L-asparaginase II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.156479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCAGAAC TACTTGTCAA CCTCGTTCGG GGTAGTCTTA TTGAATCCCA ACACCGTGGG 
GATCTGGTTG TTGCCGATCG AGAGGGTGCT ATTTTACTTA GTTTGGGTAA TCCTGAACAC
CTTGCCTATT GGCGCTCGTC AGCAAAACCC TTTCAAGCTC TGCCTTTAAT CGAACATCAT
TGTCAAGATA TTTTTAACTT TACTGAACAA GAAATAGCTC TTTTTACTTC ATCCCACGGC
GGAGAAGAAA ACCATGTGGA GGCAATTCGT GGAATCCTAC ATAAATTAGG GTTATCAGAT
TCAGCTCTGG ATTGTGGAGT CTCTGCCCCT ATGCACAGGC CCAGTGCAAA AAACATTCTT
GCATCAGGTA ATAATTTTAG TACCTTAAAT AATGCCTGTT CCGGCAAACA TACCGGTATG
CTGGCCCTAG CACTGCTCTT AAATGCCCCT TTAACCGGCT ATATACAAAA GGACCATCCC
GTTCAGCAGG AAATGCTAAA GGCCATTTGC CAATGTACAT CCTTATCCCC GGATAGGGTT
CATATGGGTG TTGATGGCTG TGGGGTCCCG GTTTTTGGAT TACCCTTAGG CAACATGGCT
ATGGCCTATG CCCGCTTATC CCTGCCCGAA GGATATTTCT CAGCGGAAAG GGTGCAAGCC
TTGCATACCA TTCGTAATGC CATGACCGAG TATCCCTATT ATGTGGCAGG CACTGACCGC
CTAGATAGTA TTCTGATGGA AGTAACTCAG GGTAGAATAG TTGCCAAGAT AGGTTCCGAA
GGAATTTACT GTGCAGGAAT TGTAGATCAT GGCATCGGGC TGGCTTTAAA AATTGAAGAT
GGTAGTTCCC GGGCCATCGA CCCCGTGATC ATTGAGGTTT TAAAACACCT GGGGTATATC
TCACAAGCTG AATTTGAGAA ACTGCGTCAT CTCTGGCGCC CCATACTAAA AAATCACCGT
GGTGATGAGA TTGGTCATTT AGAAGTTGCC TTTAATTTCA AAAATAAATA G
 
Protein sequence
MSELLVNLVR GSLIESQHRG DLVVADREGA ILLSLGNPEH LAYWRSSAKP FQALPLIEHH 
CQDIFNFTEQ EIALFTSSHG GEENHVEAIR GILHKLGLSD SALDCGVSAP MHRPSAKNIL
ASGNNFSTLN NACSGKHTGM LALALLLNAP LTGYIQKDHP VQQEMLKAIC QCTSLSPDRV
HMGVDGCGVP VFGLPLGNMA MAYARLSLPE GYFSAERVQA LHTIRNAMTE YPYYVAGTDR
LDSILMEVTQ GRIVAKIGSE GIYCAGIVDH GIGLALKIED GSSRAIDPVI IEVLKHLGYI
SQAEFEKLRH LWRPILKNHR GDEIGHLEVA FNFKNK