Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2275 |
Symbol | |
ID | 4073269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2696966 |
End bp | 2698126 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984291 |
Product | aminotransferase, class V |
Protein accession | YP_591350 |
Protein GI | 94969302 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCAA TGCAGCGCGT TTACCTCGAC AACAACGCGA CCACGCCGCT CCTCCCCGAA GTGCTGGAGG CGATGCAGCC GTATTTCCTC GGACAATTCG GCAATGCGTC GTCCATCCAC CAGCAAGGCC AGCAAGCGCG GGCCGCGGTG GAGCACGCGC GCGAGCACGT CGCGGATCTT ATCGGTGCAC GTGAGGCAGA GATCGTTTTC ACCAGCGGCG GCACCGAGGG CGACAACCTC GCGCTCTTCG GCCTCTGCAA GCCGGGCGAC CACCTGATCA TCAGCACGAT CGAGCACCAT GCGGTGCTGA ACTCCGCGCA GCGCCTGAAA GAACTCGGCG TAGAAGTCAC AGACGTTCCG GTAGACGGGC AGGGAATTGT CGATCCTGAC GCGGTAAAGC GTGCGCTCCG CGCAAATACC AGGCTGATCA GCATCATGCT CGCGAATAAC GAAACCGGTG TTGTACAGAA CGCCGTAGAG ATTGGGAAAA TCGCCGCCGA AGCCGACGTC TATTTCCACA CCGACGCCGT ACAGGCCATC GCCAAGATTC CCGTCGACGT AAATGAGATC CGCTGCGACC TGCTGACCCT CGCCGGACAT AAGATCCACG CACCGCAAGG AACAGGCGCG CTCTACGTGC GCAAGGGTAC GATCCTCGAT CCACTCTTTT ACGGCGGCCG TCATGAGCGT TCGCGACGTG CGGGGACGGA GAACCTACCG GGGATTGTCG GACTTGGCAA AGCCGCGGAA CTTGCGATGG CGTGGTTTGA GAATGACGGT CCAACTCGCA TGGCAGCGCT TCGCGACCGC CTCGAACAAA CGGTCGTCAG CCAACTCGAT CAACTGACGG TGAACAGTGG CAGCGCCCCC CGAGTGCCGA ACACCACGAA CGTGTCTTTC GATGGAATTG AAGGCGAAGC TATGGTGATC GCGCTCGATC TGAAGGGCCT TTCTGTCTCC ACCGGCGCGG CGTGTTCATC GGGCGCAATC GAACCTTCGC ACGTACTGAC CGCCATGGGC CTTACTCCCG AGCAGGCGCG GGGAAGCATT CGCTTTAGCG TAGGCAAGCA AAATACCGAG GCTGACATTC AGTTCGCCCT GGAGCGGGTG CCGGAAGTGG TCGCGAAATT GCGCGAGCTG AGCCCGGTTT ACAGGAAATA G
|
Protein sequence | MNSMQRVYLD NNATTPLLPE VLEAMQPYFL GQFGNASSIH QQGQQARAAV EHAREHVADL IGAREAEIVF TSGGTEGDNL ALFGLCKPGD HLIISTIEHH AVLNSAQRLK ELGVEVTDVP VDGQGIVDPD AVKRALRANT RLISIMLANN ETGVVQNAVE IGKIAAEADV YFHTDAVQAI AKIPVDVNEI RCDLLTLAGH KIHAPQGTGA LYVRKGTILD PLFYGGRHER SRRAGTENLP GIVGLGKAAE LAMAWFENDG PTRMAALRDR LEQTVVSQLD QLTVNSGSAP RVPNTTNVSF DGIEGEAMVI ALDLKGLSVS TGAACSSGAI EPSHVLTAMG LTPEQARGSI RFSVGKQNTE ADIQFALERV PEVVAKLREL SPVYRK
|
| |