Gene Jann_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3789 
Symbol 
ID3936269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3873017 
End bp3874564 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content64% 
IMG OID637906167 
Productaldehyde dehydrogenase 
Protein accessionYP_511731 
Protein GI89056280 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.822208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACC GGAGTGAGAA AACGCGGCGA GGGGAGGATA CCGTGGATAA CGCCATGGAG 
ACGGACGCAG AAGATGTGCC CGTCGAAACG CATCTTTTTG TGGACGGAGA GGCGCGCCCG
GCATCAGCGG GACGCCTCTA TCCCATCTAC AATCCGGCAC GACCCGATGA ATTGGTTGGC
CACGCGGCAG CAGCAGATGC CGACGATGTC GATGCCGCCG TGCGGGCGGC CGACGCAGCG
TTTCCGGCCT GGTCTTCGCG GACCTACACA GAGCGCGCAG AGCTGCTGAT CGCCATTGCC
GATGCCCTGA GTTCCGACGA TGCAGACGTC GCCCGCAGGT CCCGCCTGTT CTGCCGTGAG
CATGGCAAGA TCTTGCGCGA AACGCATCTG GAACTGAGCC GCCTTGGCGA CCGGTTCCGG
CTGAGCGCGT CCTACGCAGA ACGGCTGGCG GCGGACGAAA CCCTTCAAGG GCCACCCTTT
GATACGATCA TAACCCGCCA ACCGCGTGGG GTCGCCGCGC TGATCGTGCC ATGGAACTGG
CCCCTGTCGA TCCTTGGCGC GAAGCTGCCG CAGGCCCTGA TGGCGGGCAA TACCGTGGTT
GTGAAGCCAA GCCACAACTC CGCGCTGGCC CCGTCACAGA CGCTGCGGAT CATCGCAGAG
ATGCTGCCAC CCGGCGTGTT GAGTGTCGTG ACGGGCAGCG CGTCGGACAT CGGCGATCCG
CTGGTGCGCC ACCCGCTGGT CCGGTTCGTG AATTTCACCG GATCGGTTGA GGTCGGACGC
CACGTGATGC GGCAGGCCGC AGACAATCTG ACGCCCGTGA CACTGGAACT GGGCGGCAAT
GATGCCGCCT TGATCTGCGA GGACGCGGCG CTTGATGACG GTGCGTTCAT GCGAATGTAC
ATGGGCGCGT TCATGTCATC GGGGCAGATC TGCATGGCGC TGAAACGGCT CTACGTGCAT
CGCTCGCGTT TTGACGAGGT GGTGGATGGG CTGGAGGCCA CGTGCAACCG GATGGTCGTA
GGCGACGGCC TTTTGGACGG CACCAACATG GGCCCTGTGA ACAACGCGAA GCAACTGCAG
GTCGTGACCG ACATGATCAA CGAAGCCCGT CATAGCGGCA CGGACGTGCG AGAGCTTGGG
CAGGTGCCCG ATGAGGCGCT CTACGCGACG GGCTACTTTC AGCGCCCAAC GCTGGTTGTG
GACCCGGATC CCAGCCTGAA GATCGTCGCC GAGGAGCAAT TCGGCCCCGC CCTGCCGATC
CTACCCTTCG ACACGGAAGA CGAGGCGATT GCGGCGGCGA ATGACAGCCG CTTTGGCCTC
TGCTCATCGG TCTGGACGGA GGATCGCGAC CGCGCTGTTG CCCTCTCTCG CCGGATCGAG
GCGGGCTATA CCTATCTGAA CGCCCATGGT CCCGCGGCGC AGGACGGACG CGGACCGTTC
GGCGGGTTCA AGGACAGCGG GATCGGCAGA AATCTTGGGT ACGAGGGCGT GATCCAGTTT
CAGGGTCACC ACACGATCAG CGGGCCGAGC GGATGGCTTA TCAGTTGA
 
Protein sequence
MIDRSEKTRR GEDTVDNAME TDAEDVPVET HLFVDGEARP ASAGRLYPIY NPARPDELVG 
HAAAADADDV DAAVRAADAA FPAWSSRTYT ERAELLIAIA DALSSDDADV ARRSRLFCRE
HGKILRETHL ELSRLGDRFR LSASYAERLA ADETLQGPPF DTIITRQPRG VAALIVPWNW
PLSILGAKLP QALMAGNTVV VKPSHNSALA PSQTLRIIAE MLPPGVLSVV TGSASDIGDP
LVRHPLVRFV NFTGSVEVGR HVMRQAADNL TPVTLELGGN DAALICEDAA LDDGAFMRMY
MGAFMSSGQI CMALKRLYVH RSRFDEVVDG LEATCNRMVV GDGLLDGTNM GPVNNAKQLQ
VVTDMINEAR HSGTDVRELG QVPDEALYAT GYFQRPTLVV DPDPSLKIVA EEQFGPALPI
LPFDTEDEAI AAANDSRFGL CSSVWTEDRD RAVALSRRIE AGYTYLNAHG PAAQDGRGPF
GGFKDSGIGR NLGYEGVIQF QGHHTISGPS GWLIS