Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2647 |
Symbol | |
ID | 8137989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3084448 |
End bp | 3085617 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870251 |
Product | cysteine desulfurase NifS |
Protein accession | YP_003022441 |
Protein GI | 253701252 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.00311085 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAGA TCTATCTTGA CAACAACGCC ACCACCATGG TGGACGAGCG GGTTTTCGAG GAGATGCGTC CCTATTTCTG CGAGCTGTAC GGCAACCCGA GCTCCATGCA CTTCTTCGGG GGGCAGGTGC AAAAGAAGGT GGACGAGGCG CGCAGCCGCG TCGCCTCGCT TCTGGGCGCG CTCCCCGACG AGATCGTCTT CACCGCCTGC GGGACCGAGA GCGACAACGC CGCCATTCGT TCCGCGCTCG AGGTCTTTCC CGAAAAGCGC CACATCATCA CCAGCCGTGT CGAGCACCCC GCGGTGCTTA CCCAGTGCCG CAACCTCACC AAGCGCGGGT ACCGGGTCAC CGAGCTGAAC GTGGACGGTA ACGGGCAACT CGACCTCAAG GAACTCGAAG CGGCGCTGGA TGACGATACC GTCATTGTCT CCCTCATGTA CGCCAACAAC GAAACCGGCG TCATCTTCCC TATCGAGGAA GCCGCCAGGA TGGTGAAGGC GAAGGGCGCG CTCTTCCACA CCGACGCCGT TCAGGCCGTG GGCAAGATCC CGCTCAACAT GGCCGAATCC GCCATCGACC TGCTTTCCCT TTCCGGGCAC AAGCTGCACG CCCCCAAAGG GGTAGGCGTA CTTTACGTGC GCCGCGGCAC GCCGTTTCGC CCGCTTCTGG TCGGCGGCCA CCAGGAGCGC GGGCGCAGGG CGGGGACCGA GAACACCGCG TCCATCATCG CCATGGGCAA GGCCTGCGAG CTTGCCCACC TGCACATGCC CGAGGAAGCG GGGCGCGTGC GCGAGATGCG CGACAGGCTG GAGCGCGAAC TGACCGCGCT CATCCCCAAC ACCAGGATCA ACGGCGGCGG CACCGACCGT CTCCCCAACA CCCTTTCCAT CGCCATGGAG TTCGTGGAAG GGGAGGGGAT ACTGCTGCTT CTCTCCGAGA AGGGAATCTG CGCCTCCTCC GGCAGCGCCT GCACCTCCGG CTCGTTGGAG CCGTCCCACG TACTGCGCGC CATGGGTGTT CCCTTTACCT GCGCCCACGG CTCCATCCGC TTCTCGCTCT CCAGGTTCAC CACCGACGCC GAGATCGACG CCGTCATCGA AGCTTTGCCG CCGATCATCA GCCGCCTGCG CCAGATGTCG CCGTTTGGCA GGGAGTTCCT GAACAAATAG
|
Protein sequence | MKEIYLDNNA TTMVDERVFE EMRPYFCELY GNPSSMHFFG GQVQKKVDEA RSRVASLLGA LPDEIVFTAC GTESDNAAIR SALEVFPEKR HIITSRVEHP AVLTQCRNLT KRGYRVTELN VDGNGQLDLK ELEAALDDDT VIVSLMYANN ETGVIFPIEE AARMVKAKGA LFHTDAVQAV GKIPLNMAES AIDLLSLSGH KLHAPKGVGV LYVRRGTPFR PLLVGGHQER GRRAGTENTA SIIAMGKACE LAHLHMPEEA GRVREMRDRL ERELTALIPN TRINGGGTDR LPNTLSIAME FVEGEGILLL LSEKGICASS GSACTSGSLE PSHVLRAMGV PFTCAHGSIR FSLSRFTTDA EIDAVIEALP PIISRLRQMS PFGREFLNK
|
| |