Gene Jann_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3507 
Symbol 
ID3935981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3562415 
End bp3563899 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID637905881 
Productaldehyde dehydrogenase 
Protein accessionYP_511449 
Protein GI89055998 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.190202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000753817 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATCAAG CCCGCATCGA CGCCCTTCGC GGGCAGCCTG TTCCCCCGTT TGGGCATCTG 
ATTGATGGTA AGATTGTGCC TGCCTCGGAC GGGGCTGTGA TGGACGTGAC CTCCCCCCTG
AATGGGGAGG TACTGACAAC CAGCGCGCGT GGCACGGCGT TGGATATGGA GACGGCCATT
GCGTCGGCCC GCACAGCGTT TGAGGATAGG CGTTGGGCTG GCCAATCTCC CGCCGCCCGC
AAGAAAGTGC TGCACAAATG GGCTGATCTG ATCGAGGCCC ACGCGCTGGA ACTGGCTGTT
CTGGGCGTGC GCGACAATGG GACTGAAATT TCCATGGCCC TGAAGGCCGA ACCAGGGTCC
GCCGCAGCGA CGATCCGTTA CTATGCGGAG GCTTTGGATA AGGTTTACGG CGAAATCGCC
CCCACGGCGG GTGATGTCCT GGGCCTAATC CACAAGGAAC CCGTGGGCGT TGTCGGCGCG
ATCATCCCGT GGAATTTCCC ACTGATGATC GGCGCGTGGA AGCTGGGGCC TGCCCTGGCG
ATGGGCAATT CGGTGGTGTT GAAGCCCGCG GAAACGGCGT CCTTGTCGCT GATGCGCATG
GCGGAGCTGG CGTTGGAGGC CGGCCTGCCG CCGGGTGTTT TGAACGCAGT GACCGGGGAA
GGCGCAGTGG TGGGGGAGGT CATGGGCCTC TCGATGGATG TGGACGTGCT GGTGTTCACC
GGCTCCGGCG GGACGGGGCG GCGGTTGCTG GACTATGCCG CCCGGTCCAA CCTCAAACGG
GTGTATCTGG AGTTGGGCGG AAAAAGCCCC AACGTCATCT TCGCCGACGC CCCAAATCTG
GAGGAGGCCG CAAAGGTCGC GGCGGGTGGT ATTTTTCGCA ACGCGGGGCA GGTCTGCGTG
GCGGGGTCCC GGTTGCTGGT GCAGGCATCT ATCCACGATG AATTCGTTGC CATGGTCGCG
CAGGTCACGG AACAGATGCA GGTGGGCGAC CCGCTCAACC TGTCCACGCA GATCGGCGCG
GTGAACTCGG AAGCGCAACT GGCCCAGAAC CTCGCGTTTG TGGATACGGC GCAGGCCGAA
GGCGGCACAG TCACGACGGG CGGCACGCGT GTGTTGCAAG AGACCGGCGG CAGCTTCATG
GCCCCCACGA TCGTGACGGG CGTCACCCCG GACGCGACCC TATCGCAGAA AGAGGTCTTC
GGCCCCATCC TCGCCGTGAC CCCGTTTGAG ACGGACGAGG ACGCCGTGCG CATCGCCAAT
TCCACCGTCT ACGGGCTGGC AGGCGCGGTC TGGACATCGA ACCTTGGCCG CGCCCATCGG
ATGGTTCGCG ATATCCGCGC CGGCGTGATG CATGTGAACA CCTATGGCGG CGCGGACGGC
ACGGTGCCCT TGGGCGGTGT TGGGCAATCG GGCAACGGGT CTGACAAGTC GCTCCATGCG
CTGGACAAAT ACATCAACCT CAAGACGGCG TGGATCAAAC TATGA
 
Protein sequence
MDQARIDALR GQPVPPFGHL IDGKIVPASD GAVMDVTSPL NGEVLTTSAR GTALDMETAI 
ASARTAFEDR RWAGQSPAAR KKVLHKWADL IEAHALELAV LGVRDNGTEI SMALKAEPGS
AAATIRYYAE ALDKVYGEIA PTAGDVLGLI HKEPVGVVGA IIPWNFPLMI GAWKLGPALA
MGNSVVLKPA ETASLSLMRM AELALEAGLP PGVLNAVTGE GAVVGEVMGL SMDVDVLVFT
GSGGTGRRLL DYAARSNLKR VYLELGGKSP NVIFADAPNL EEAAKVAAGG IFRNAGQVCV
AGSRLLVQAS IHDEFVAMVA QVTEQMQVGD PLNLSTQIGA VNSEAQLAQN LAFVDTAQAE
GGTVTTGGTR VLQETGGSFM APTIVTGVTP DATLSQKEVF GPILAVTPFE TDEDAVRIAN
STVYGLAGAV WTSNLGRAHR MVRDIRAGVM HVNTYGGADG TVPLGGVGQS GNGSDKSLHA
LDKYINLKTA WIKL