Gene Jann_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3503 
Symbol 
ID3935977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3558723 
End bp3560270 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content62% 
IMG OID637905877 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_511445 
Protein GI89055994 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00116503 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGC TGGACGAAAA CATCGCAAAA CTGGACGGCT ATGTGGCGCG GTTCCGCGAG 
GGTGGCATTC CAAACCGGAT CGGGGGCGTG GACGTGCCGG GGGCTGGCGG TGTGTTCCAG
ACCATGTCTC CGGTCGATAA AAGCGTCATC TGTGATGTGG CCCACGGAAC GGAGGCGGAT
ATCGACGCCG CCGCCAATGC GGCCCACGGG GCGTTTCCCG CTTGGCGTGA CATGCCCGCG
ACGGAGCGGA AGCGCATCCT TGTTCGCGTG GCCGACGCCA TTGAAGCGCG CGCCGAGGAA
ATCGCGCTCT GCGAATGCTG GGACACGGGC CAGGCTTTCA AATTCATGTC CAAGGCGGCC
CTGCGCGGGG CGGAAAACTT CCGTTATTTT GCTGATCAGG TGGTTCAGGC CCGCGATGGT
CAGCACCTGA AATCGCCCAC GTTGATGAAC GTGACCTCCC GCGTGCCCAT CGGCCCCGTG
GGGGTCATCA CGCCCTGGAA CACGCCATTC ATGCTGTCGA CGTGGAAGAT CGCACCGGCG
CTGGCGGCGG GCTGCACGGT GGTCCACAAA CCGGCGGAAG CTTCCCCGCT GACCGCGCGG
CTGTTGGTGG AAATCGCGGA AGAGGCGGGC CTTCCGCCCG GCGTGCTGAA CACGGTCAAC
GGCTTCGGGG AAGGGGCTGG AAAGGCGCTC TGCGAGCATC CGAAAATCCG GGCGATTGCC
TTTGTGGGTG AATCCAAGAC AGGCTCCCTG ATCACTAAGC AAGGGGCGGA CACGCTCAAG
CGCAACCATC TGGAATTGGG CGGCAAGAAC CCCGTCATCG TTTTTGAAGA CGCCGACCTG
GAGCGCGCTT TGGATGCGGT GATCTTCATG ATCTACTCCA TCAATGGGGA GCGTTGCACG
TCGTCCTCCC GGCTTCTTGT GCAAGATACG ATCCGAGAAG ATTTTGAGGC GAAGCTGGTG
GCGCGCGTCA ACGCCATCAA AGTCGGCCAC CCCCTGGATC CGACGACGGA AGTGGGCCCG
CTGATCAGCG AAGAGCATTT CGCCAAAGTT ACCAGCTACT TCGATATCGC GCGCCAGGAC
GGTGCGACCA TTGCGGCAGG GGGCGAGGCC TTCGGTGACA GCGGCTACTT CGTCAAACCC
ACGCTCTTCA CCAAGGCCAC CAACGACATG CGCATTGCGC AGGAAGAGAT CTTTGGTCCC
GTCCTCACCT CCATCCCGTT TTCGTCCGAG GAGGAGGCGC TTCGGATCGC CAACGACACA
CCCTACGGCC TCACCGGATA TGTCTGGACC AATGACCTGA CCCGCGCCCT GCGGTTCACG
GATGCGCTGG AGGCGGGGAT GATTTGGGTG AATTCCGAGA ATGTCCGCCA CCTGCCGACC
CCGTTCGGTG GGGTGAAATC GTCGGGGATC GGACGCGATG GCGGGGATTG GAGTTTCGAG
TTCTACATGG AGCAAAAGCA TGTGGGCTTC GCCGTAGGGC AGCACAAAAT CACCAAGTTG
GGTGCGCTCA AGCAGCAAAG CGATAGCCCA GAAAGGGGAG CCTCTTAG
 
Protein sequence
MSKLDENIAK LDGYVARFRE GGIPNRIGGV DVPGAGGVFQ TMSPVDKSVI CDVAHGTEAD 
IDAAANAAHG AFPAWRDMPA TERKRILVRV ADAIEARAEE IALCECWDTG QAFKFMSKAA
LRGAENFRYF ADQVVQARDG QHLKSPTLMN VTSRVPIGPV GVITPWNTPF MLSTWKIAPA
LAAGCTVVHK PAEASPLTAR LLVEIAEEAG LPPGVLNTVN GFGEGAGKAL CEHPKIRAIA
FVGESKTGSL ITKQGADTLK RNHLELGGKN PVIVFEDADL ERALDAVIFM IYSINGERCT
SSSRLLVQDT IREDFEAKLV ARVNAIKVGH PLDPTTEVGP LISEEHFAKV TSYFDIARQD
GATIAAGGEA FGDSGYFVKP TLFTKATNDM RIAQEEIFGP VLTSIPFSSE EEALRIANDT
PYGLTGYVWT NDLTRALRFT DALEAGMIWV NSENVRHLPT PFGGVKSSGI GRDGGDWSFE
FYMEQKHVGF AVGQHKITKL GALKQQSDSP ERGAS