Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3503 |
Symbol | |
ID | 3935977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 3558723 |
End bp | 3560270 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637905877 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_511445 |
Protein GI | 89055994 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00116503 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAGC TGGACGAAAA CATCGCAAAA CTGGACGGCT ATGTGGCGCG GTTCCGCGAG GGTGGCATTC CAAACCGGAT CGGGGGCGTG GACGTGCCGG GGGCTGGCGG TGTGTTCCAG ACCATGTCTC CGGTCGATAA AAGCGTCATC TGTGATGTGG CCCACGGAAC GGAGGCGGAT ATCGACGCCG CCGCCAATGC GGCCCACGGG GCGTTTCCCG CTTGGCGTGA CATGCCCGCG ACGGAGCGGA AGCGCATCCT TGTTCGCGTG GCCGACGCCA TTGAAGCGCG CGCCGAGGAA ATCGCGCTCT GCGAATGCTG GGACACGGGC CAGGCTTTCA AATTCATGTC CAAGGCGGCC CTGCGCGGGG CGGAAAACTT CCGTTATTTT GCTGATCAGG TGGTTCAGGC CCGCGATGGT CAGCACCTGA AATCGCCCAC GTTGATGAAC GTGACCTCCC GCGTGCCCAT CGGCCCCGTG GGGGTCATCA CGCCCTGGAA CACGCCATTC ATGCTGTCGA CGTGGAAGAT CGCACCGGCG CTGGCGGCGG GCTGCACGGT GGTCCACAAA CCGGCGGAAG CTTCCCCGCT GACCGCGCGG CTGTTGGTGG AAATCGCGGA AGAGGCGGGC CTTCCGCCCG GCGTGCTGAA CACGGTCAAC GGCTTCGGGG AAGGGGCTGG AAAGGCGCTC TGCGAGCATC CGAAAATCCG GGCGATTGCC TTTGTGGGTG AATCCAAGAC AGGCTCCCTG ATCACTAAGC AAGGGGCGGA CACGCTCAAG CGCAACCATC TGGAATTGGG CGGCAAGAAC CCCGTCATCG TTTTTGAAGA CGCCGACCTG GAGCGCGCTT TGGATGCGGT GATCTTCATG ATCTACTCCA TCAATGGGGA GCGTTGCACG TCGTCCTCCC GGCTTCTTGT GCAAGATACG ATCCGAGAAG ATTTTGAGGC GAAGCTGGTG GCGCGCGTCA ACGCCATCAA AGTCGGCCAC CCCCTGGATC CGACGACGGA AGTGGGCCCG CTGATCAGCG AAGAGCATTT CGCCAAAGTT ACCAGCTACT TCGATATCGC GCGCCAGGAC GGTGCGACCA TTGCGGCAGG GGGCGAGGCC TTCGGTGACA GCGGCTACTT CGTCAAACCC ACGCTCTTCA CCAAGGCCAC CAACGACATG CGCATTGCGC AGGAAGAGAT CTTTGGTCCC GTCCTCACCT CCATCCCGTT TTCGTCCGAG GAGGAGGCGC TTCGGATCGC CAACGACACA CCCTACGGCC TCACCGGATA TGTCTGGACC AATGACCTGA CCCGCGCCCT GCGGTTCACG GATGCGCTGG AGGCGGGGAT GATTTGGGTG AATTCCGAGA ATGTCCGCCA CCTGCCGACC CCGTTCGGTG GGGTGAAATC GTCGGGGATC GGACGCGATG GCGGGGATTG GAGTTTCGAG TTCTACATGG AGCAAAAGCA TGTGGGCTTC GCCGTAGGGC AGCACAAAAT CACCAAGTTG GGTGCGCTCA AGCAGCAAAG CGATAGCCCA GAAAGGGGAG CCTCTTAG
|
Protein sequence | MSKLDENIAK LDGYVARFRE GGIPNRIGGV DVPGAGGVFQ TMSPVDKSVI CDVAHGTEAD IDAAANAAHG AFPAWRDMPA TERKRILVRV ADAIEARAEE IALCECWDTG QAFKFMSKAA LRGAENFRYF ADQVVQARDG QHLKSPTLMN VTSRVPIGPV GVITPWNTPF MLSTWKIAPA LAAGCTVVHK PAEASPLTAR LLVEIAEEAG LPPGVLNTVN GFGEGAGKAL CEHPKIRAIA FVGESKTGSL ITKQGADTLK RNHLELGGKN PVIVFEDADL ERALDAVIFM IYSINGERCT SSSRLLVQDT IREDFEAKLV ARVNAIKVGH PLDPTTEVGP LISEEHFAKV TSYFDIARQD GATIAAGGEA FGDSGYFVKP TLFTKATNDM RIAQEEIFGP VLTSIPFSSE EEALRIANDT PYGLTGYVWT NDLTRALRFT DALEAGMIWV NSENVRHLPT PFGGVKSSGI GRDGGDWSFE FYMEQKHVGF AVGQHKITKL GALKQQSDSP ERGAS
|
| |