Gene Jann_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3747 
Symbol 
ID3936227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3828741 
End bp3830177 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content62% 
IMG OID637906125 
Productaldehyde dehydrogenase 
Protein accessionYP_511689 
Protein GI89056238 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC ACATGTCCCT GATTGACGGC CAATGGGCCG ACTTCGGCAA CTTCCAGCCC 
GATATCAACC CGGCCAACAC CGAAGAGGTG ATTGGCGAAT TCTCCTACGG CGACGCGAGC
GCAGTAGATG CGGCTGTGGA AGCCGCCACG CGGGCCGCAC CGGAATGGGC CGCGACGACC
CCGCAGGTGC GCCATGACCT GCTCGCGGCC GTTGGCAACG CGCTGGGTGC GAATGCGGCT
GACTATGGCC GTATATTGGC ACGAGAAGAG GGCAAGACTT TGCCGGAAGC CATCGGCGAA
GTGGGCCGCG CATCCCAGGT ATTCAAGTTT TTCGCAGGCG AAGCACTGCG CATCGGCGGG
CAACTCCACC CCTCGGTCCG ACCCGATATC GACATTGAGG TCACGCGGGA ACCGGTGGGG
GTTGTGGGCC TGATCACGCC ATGGAACTTT CCCATCGCGA TCCCCGCTTG GAAGGCCGCG
CCCGCACTGG CTTACGGCAA TTGCGTCGTG ATGAAGCCCG CCGATCTGAC GCCTGCCTGC
GCCCAGATCA TCGCCAAGCT GTTGCACGAG AATGGCTGTC CGCCGGGTGT GTTCAACCTC
GTCTACGGGC GCGGGTCGGT CGTGGGTGAA GCGCTGGTCA ACCACCCCGA CGTGGGCGCG
ATTTCCTTCA CCGGATCGGC GGAGGTCGGC AACCGCATGG CCGAGATCTG CGGCAAACTC
CGCAAGAAAA TCCAGCTGGA GATGGGTGGA AAGAACCCCA TGGTCGTCAT GGACGACGCG
GATATGGACG TCGCCGTGGG CGCCTGCCTG AACGGCGCGT TCTTCTCCAC CGGGCAACGC
TGCACCGCGT CGTCCCGGCT GATCGTGCAA GCGGGTATTC ATGATGCGTT TGTCGAAAAG
CTTCGCGACG CGATGGTGGC CCTGAAGGTC GGTGATCCAC TCGATGAGAC GACGCAGATC
GGGCCCGTAA TCGATGAAAC GCAGCTCAAG TCGAACCTGG ACTATGTGGA CCTCGCCAAA
CAGGAGGCGT CGGACGTTGT CGGCGGCGAA CGGTTGGAGC TGTCGACCCC CGGCTTTTTT
CAGGCCCCCG CCCTGTTCCT CGGCACCACC AATGACATGC GCATCAACCG GGAGGAAATC
TTCGGCCCGA CGGCTTCCAT CATCAAGGTC GGTGACTTCG AAGAGGCCGT CGCCATCGCC
AATGACACCG AATTCGGGCT CTCGTCCGGC ATCTGCACTA CCAGCCTGAA ACATGCGCGC
GAATACCGTC GACGAGCGCA GGCGGGTATG TTGATGGTCA ACCTCCCGAC AGCAGGCGTG
GATTACCACG TGCCCTTCGG CGGGCGTAAA GGCTCAAGCT TCGGGCCGCG AGAACAGGGC
AGCTTCGCTG CCGAATTCTA CACCATCGTC AAGACATCCT ACGTTTTCTC GGGGTAG
 
Protein sequence
MTKHMSLIDG QWADFGNFQP DINPANTEEV IGEFSYGDAS AVDAAVEAAT RAAPEWAATT 
PQVRHDLLAA VGNALGANAA DYGRILAREE GKTLPEAIGE VGRASQVFKF FAGEALRIGG
QLHPSVRPDI DIEVTREPVG VVGLITPWNF PIAIPAWKAA PALAYGNCVV MKPADLTPAC
AQIIAKLLHE NGCPPGVFNL VYGRGSVVGE ALVNHPDVGA ISFTGSAEVG NRMAEICGKL
RKKIQLEMGG KNPMVVMDDA DMDVAVGACL NGAFFSTGQR CTASSRLIVQ AGIHDAFVEK
LRDAMVALKV GDPLDETTQI GPVIDETQLK SNLDYVDLAK QEASDVVGGE RLELSTPGFF
QAPALFLGTT NDMRINREEI FGPTASIIKV GDFEEAVAIA NDTEFGLSSG ICTTSLKHAR
EYRRRAQAGM LMVNLPTAGV DYHVPFGGRK GSSFGPREQG SFAAEFYTIV KTSYVFSG