Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0484 |
Symbol | |
ID | 4068609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 598115 |
End bp | 599368 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982488 |
Product | aminotransferase, class V |
Protein accession | YP_589563 |
Protein GI | 94967515 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0750137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA ACGGAAATGG GAACAAGGCG GTGAAGCTGC CGATTTACAT GGACAACCAC GCGACCACGC CGATGGATCA GCGTGTACTC GACGAAATGC TTCCCTACTT CATTGAGAAG TTCGGGAATG CCGCGAGCCG TAATCACGAG TTCGGCTGGG TGGCTGAGCA AGCAGTGGAC CAGGCGCGTG AGCGGATCGC GAAGGTGATC GGCGCGACGT CGAAGGAGAT CATTTTCACG TCGGGTGCGA CGGAGAGCGA CAACCTGGCG ATCAAGGGTG TGGCGCAGAT GTATCGCGAG AAGGGCAACC ACATCATTAC GCAGGTAACG GAGCACAAGG CCGTACTCGA CACGTGCAAG CGCCTCGAGA AGGAAGGCTT CCGCGTTACT TATCTGCCGG TGCAGAAGGA CGGGCGGATC GATCTCGACG ACCTGAAGCG CGCGATGGAC GACAAGACGA TCCTGGTGAC GATCATGGCG GCGAACAACG AGATCGGCGT GTTGCAGCCG ATCCGCGAGA TTGGCGCGCT GTGTCACGAG AAGGGCGTGG TCTTCCATAC CGACGCGGTG CAGATCATCG GCAAGGTTCC GTTCAATGTG ATTCAAGACA ACGTTGACCT GGCGTCGATC AGCGGACACA AGCTGTATGG GCCGAAGGGT GTGGGCGCTC TGTATGTGCG TCGCAAGAAC CCGCGCGTGC AACTGGTGGC GCAGATCGAT GGCGGCGGTC ATGAGCGCGG CATGCGGTCG GGCACGCTGA ATGTTACGGG CATCGTTGGC CTTGGGAAGG CGATTGAGCT GGCTGGCCAG GAGATGGCAG AAGAAGGCAA GCGCATGACG GCGCTGCGCG ATCGGCTGAA GGACAAGATC TTCTCGGAGC TCGACGAAGT GTACGTCAAC GGCTCGTGGG AGCATCGGCT GCCGGGCAAT CTGAATATTA GTTTTGCCTT CGTTGAGGGC GAGTCGCTGC TGATGGGGAT CAATGACATT GCGGTTTCGA GCGGTTCGGC ATGCACCTCT GCGACGTTGG AGCCATCTTA TGTATTAAAG GCACTCGGCG CAGGCGACGA TCTTGCTCAC AGCTCCATCC GCTTCGGGTT GGGACGCTTC AACACGGACG CGGAAGTGGA TTATGTGGCA AACAAGCTGA TTGACGTGGT GAAGAGACTG CGCGAGTTGT CGCCGCTGTA CGAGATGCAC AAGGAAGGCA TCGATTTGAC GAAGGTGCAG TGGGCTGCCG AGGGCGGGCA CTAA
|
Protein sequence | MSTNGNGNKA VKLPIYMDNH ATTPMDQRVL DEMLPYFIEK FGNAASRNHE FGWVAEQAVD QARERIAKVI GATSKEIIFT SGATESDNLA IKGVAQMYRE KGNHIITQVT EHKAVLDTCK RLEKEGFRVT YLPVQKDGRI DLDDLKRAMD DKTILVTIMA ANNEIGVLQP IREIGALCHE KGVVFHTDAV QIIGKVPFNV IQDNVDLASI SGHKLYGPKG VGALYVRRKN PRVQLVAQID GGGHERGMRS GTLNVTGIVG LGKAIELAGQ EMAEEGKRMT ALRDRLKDKI FSELDEVYVN GSWEHRLPGN LNISFAFVEG ESLLMGINDI AVSSGSACTS ATLEPSYVLK ALGAGDDLAH SSIRFGLGRF NTDAEVDYVA NKLIDVVKRL RELSPLYEMH KEGIDLTKVQ WAAEGGH
|
| |