Gene Tcur_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_4249 
Symbol 
ID8605605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp4850375 
End bp4851940 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003301814 
Protein GI269128444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCC CTACGGCCTC AGCGCTGACG CTCGCCCCCG CCCTCATCGA GCGGTTGTCC 
TCTCACGTGA CCTCCGCCTC GGGCGGCACC ACCACCATTC CCGCCCCCTT CACCGGGGAG
CCGCTGGCCA CCGTGCCGGT GTCGAACGCC GACGACGTGC GCGCCGCCTA CGAGCGCGCC
CGCGAGGCCC AGCGGGCCTG GGAGCGGCTG CCGGTGCGCG AGCGGGTCAA GCCGTTCCTG
CGCATGGTGG ACGTCCTGGT CGACCGGCGC GAGGAGATCC TGGACGTCAT CCAGATGGAG
ACCGGGAAGG CCCGCCGGCA CGCCTATGAG GAGTTCCTCG ACGTGGCGCT GTGCACGCTC
TACTACGCGC GGCGGGCGCC CCGGCTGCTC AAGCCCCGGC GGCGGCAGGG CGCCTTCCCG
GTGGCCACCC GGACGGTGGA GCTGCGCAAG CCCAAAGGCG TGGTCGGGCT GATCTCGCCC
TGGAACTACC CCTTCAGCCT GGGCGCCAGC GACATCGCCC CGGCGCTGAT GGCCGGCAAC
GCGGTGATCC ACAAGCCCGA CACCCAGACC TGCCTGTCCA GCCTGTGGGT GCTGGACCTG
CTGATCAGCC TGGGGTTGCC GCGCGACCTG TGGCAGATCG TGGTGGGCGA CCCGGCCGAG
ATCGGCGACC CGCTGCTGGA GAACGCCGAC TACGTCGCCT TCACCGGCTC CACCCGCGGC
GGCCGGGCCA TCGCCGAGAA GGTCGCCCCC CGCCTGGTGG GCTACTCGCT GGAGCTGGGC
GGCAAGAACC CGATGATCGT CCTGGAGGAC GCCGACGTCG AGCGCACCGC CCGCGGCGCG
CTGCGCGCCT GCTTCACCAA CGCCGGGCAG CTGTGCATCT CCATCGAGCG GCTGTACGTC
CACGAGAAGA TCTACGACCG GTTCGTGCCC CGCTTCGTGG AGCAGGTCAA GGCGATGAAG
CTCGGCGCCG GGCTGGACTA CGAGGCCGAC ATGGGCTCGC TGACCTACCC GCGCCAGCTG
GAGGTCGTCA GCCGGCACGT CGAGCAGGCC CTCAAGGAGG GCGCCACGCT GCTGGCCGGC
GGCAAGGCCC GCCCCGACAT CGGCCCGCTG TTCTATGAGC CCACCGTGCT GACGAACGTC
ACCGGCGACA TGGAGCTGTG CGCCAACGAG ACCTTCGGCC CGGTGGTCAG CGTCTACAAG
TTCTCCGACG AAGACGAGGT CGTCCGCCTC GCCAACGACA CCGCCTACGG CCTGAACGCC
TCCATCTGGA CGCGGAATGT GGCCCGGGGC CGCCGCCTGG CCGCCCGCAT CAACGCCGGG
ACGGTGAACA TCAACGAAGG CTACGGCGCC GCGTTCGCCT CCTACGACGC CCCGATGGGC
GGCATGAAGC AGTCCGGCCT GGGCCGCCGG CACGGTGCCG AGGGCATCCT CAAGTACACC
GAGCCGCAGA CTGTTGCCAG CCAGCACCTG GTGGAGCTGG CCCCGCCGCC GTTCCTGGGC
TATGACCGCT ACGCCACCGG CATGGCGACC GCGATCAAGC TCATGAAGCG GCTGCGGATC
AGGTAG
 
Protein sequence
MATPTASALT LAPALIERLS SHVTSASGGT TTIPAPFTGE PLATVPVSNA DDVRAAYERA 
REAQRAWERL PVRERVKPFL RMVDVLVDRR EEILDVIQME TGKARRHAYE EFLDVALCTL
YYARRAPRLL KPRRRQGAFP VATRTVELRK PKGVVGLISP WNYPFSLGAS DIAPALMAGN
AVIHKPDTQT CLSSLWVLDL LISLGLPRDL WQIVVGDPAE IGDPLLENAD YVAFTGSTRG
GRAIAEKVAP RLVGYSLELG GKNPMIVLED ADVERTARGA LRACFTNAGQ LCISIERLYV
HEKIYDRFVP RFVEQVKAMK LGAGLDYEAD MGSLTYPRQL EVVSRHVEQA LKEGATLLAG
GKARPDIGPL FYEPTVLTNV TGDMELCANE TFGPVVSVYK FSDEDEVVRL ANDTAYGLNA
SIWTRNVARG RRLAARINAG TVNINEGYGA AFASYDAPMG GMKQSGLGRR HGAEGILKYT
EPQTVASQHL VELAPPPFLG YDRYATGMAT AIKLMKRLRI R