Gene Jann_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3937 
Symbol 
ID3936418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4035087 
End bp4036637 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID637906315 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_511879 
Protein GI89056428 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.611638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA TGACGATTAT TCCCTCCGCC GGGGTCTCCA TCCCTGCGCC TTTTCGGGGG 
CGGCACCTGA TCGGTGGCGT GTGGTGCGAC AGTGCCGATG GTGCAGTGTC CGACAGGCAC
TCCCCGGCCC ATGGCACCCA CGTCAGCACG GCTGCCAGGG GCGGCGCGAC GGAGGCGGAC
GCTGCCATTG CGGCGGCTCG GACCACCTTC GATGCGGGCG ATTGGCCGTT CTCCAGCGGG
GCATCGCGCG CGGCGATCCT GCTCAAAGTC GCGGACCTGA TTGAGCGGGA TCTGGACCGG
ATCGCCCTTC TGGAAACGCT CGAATCCGGC AAGCCGATCA GCCAGGCAAA AGCCGAGATC
GGCGGCGCGG CGGACCTGTG GCGCTACGCT GCCAGCCTCG CGCGGATGAT CCATGGCGAT
AGCCACAATT CCCTTGGCGC GGACATGTTG GGTGTCGTCC TGAAAGAGCC CATCGGCGTC
GTGTCCATGA TCACGCCCTG GAACTTTCCG TTCCTGATCG TGTCCCAAAA GTTGCCCTTC
GCGCTGGCGG CAGGCTGCAC GGCGGTGATC AAACCGTCGG AACTGACGCC GTCCACGACC
TGCATTCTGG GTGAATTACT GTTCGAGGCA GGGCTGCCCG CAGGGGTCGC CAACATCGTG
CTGGGGTTTG GCGACCCGGT GGGCGAGGTT CTGTCGACGG ATCCACGCGT GGATATGGTC
AGCTTCACCG GCTCCACCGG CGTCGGCAAA CAGATTTCCG CAGCCGCCAG CGGCACGTTG
AAGAAGGTCT CGCTGGAGTT GGGCGGCAAG AACCCGCAGG TGATCTTCCC CGACGCCGAT
TTGGATCAGG CCGCCGATGC GATCACCTTC GGCGTCTATT TCAACGCGGG CGAATGCTGC
AACTCCGGCT CCCGTATCAT CGTGCATGAA GATGTGGCGG AGGAGCTGAC CGCAAAGGTC
GTCGCCCTGT CGCGCCGCGT GCCGTTCGGC GACCCGCTGG ACCCGGCCAC CCAAGTCGGC
GCGATCATTT CGCCCGAGCA TATGGCGAAG ATCGACGGCT ATGTGCAGGA CGCCGTGAAG
GATGGCGCGC GGCTTGCCAT CGGTGGCGCG GCGCTGGACG TAGACGGTGT GGGGCCGCAA
TTCTACCAGC CCACGGTGGT CACCGATCTG CGCGAAGACA TGGCCATCGC GCGTGATGAG
GTCTTTGGTC CGGTGCTGGC TGTGCTGACG TTTCGGACCC TCGATGACGC CTTAAGTCTT
TGCAACAACG CAACTTATGG CCTGTCTGCG GGGGTTTGGT CCAAGGACAT GTCCACCTGC
CTGTCATTCG CGCGCCGGGT GCAGGCGGGG ACCGTGTGGA CAAACACATG GATGGACGGC
TTCCCGGAAA TGCCTTTTGG CGGGGTCAAG GAAAGCGGGC AGGGACGCGA ATTGGGGCGC
TATGGTCTTG AGGAATTCCT GGAGGTCAAA ACCGTCCAGA TGCGCATCGG CGACAGCCGT
CAGATGTGGG TCACGCCGGA GGGCGTGCAA TCAGCGGATC TCTCTGAATG A
 
Protein sequence
MTEMTIIPSA GVSIPAPFRG RHLIGGVWCD SADGAVSDRH SPAHGTHVST AARGGATEAD 
AAIAAARTTF DAGDWPFSSG ASRAAILLKV ADLIERDLDR IALLETLESG KPISQAKAEI
GGAADLWRYA ASLARMIHGD SHNSLGADML GVVLKEPIGV VSMITPWNFP FLIVSQKLPF
ALAAGCTAVI KPSELTPSTT CILGELLFEA GLPAGVANIV LGFGDPVGEV LSTDPRVDMV
SFTGSTGVGK QISAAASGTL KKVSLELGGK NPQVIFPDAD LDQAADAITF GVYFNAGECC
NSGSRIIVHE DVAEELTAKV VALSRRVPFG DPLDPATQVG AIISPEHMAK IDGYVQDAVK
DGARLAIGGA ALDVDGVGPQ FYQPTVVTDL REDMAIARDE VFGPVLAVLT FRTLDDALSL
CNNATYGLSA GVWSKDMSTC LSFARRVQAG TVWTNTWMDG FPEMPFGGVK ESGQGRELGR
YGLEEFLEVK TVQMRIGDSR QMWVTPEGVQ SADLSE