Gene GM21_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2647 
Symbol 
ID8137989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3084448 
End bp3085617 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID644870251 
Productcysteine desulfurase NifS 
Protein accessionYP_003022441 
Protein GI253701252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.00311085 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAGA TCTATCTTGA CAACAACGCC ACCACCATGG TGGACGAGCG GGTTTTCGAG 
GAGATGCGTC CCTATTTCTG CGAGCTGTAC GGCAACCCGA GCTCCATGCA CTTCTTCGGG
GGGCAGGTGC AAAAGAAGGT GGACGAGGCG CGCAGCCGCG TCGCCTCGCT TCTGGGCGCG
CTCCCCGACG AGATCGTCTT CACCGCCTGC GGGACCGAGA GCGACAACGC CGCCATTCGT
TCCGCGCTCG AGGTCTTTCC CGAAAAGCGC CACATCATCA CCAGCCGTGT CGAGCACCCC
GCGGTGCTTA CCCAGTGCCG CAACCTCACC AAGCGCGGGT ACCGGGTCAC CGAGCTGAAC
GTGGACGGTA ACGGGCAACT CGACCTCAAG GAACTCGAAG CGGCGCTGGA TGACGATACC
GTCATTGTCT CCCTCATGTA CGCCAACAAC GAAACCGGCG TCATCTTCCC TATCGAGGAA
GCCGCCAGGA TGGTGAAGGC GAAGGGCGCG CTCTTCCACA CCGACGCCGT TCAGGCCGTG
GGCAAGATCC CGCTCAACAT GGCCGAATCC GCCATCGACC TGCTTTCCCT TTCCGGGCAC
AAGCTGCACG CCCCCAAAGG GGTAGGCGTA CTTTACGTGC GCCGCGGCAC GCCGTTTCGC
CCGCTTCTGG TCGGCGGCCA CCAGGAGCGC GGGCGCAGGG CGGGGACCGA GAACACCGCG
TCCATCATCG CCATGGGCAA GGCCTGCGAG CTTGCCCACC TGCACATGCC CGAGGAAGCG
GGGCGCGTGC GCGAGATGCG CGACAGGCTG GAGCGCGAAC TGACCGCGCT CATCCCCAAC
ACCAGGATCA ACGGCGGCGG CACCGACCGT CTCCCCAACA CCCTTTCCAT CGCCATGGAG
TTCGTGGAAG GGGAGGGGAT ACTGCTGCTT CTCTCCGAGA AGGGAATCTG CGCCTCCTCC
GGCAGCGCCT GCACCTCCGG CTCGTTGGAG CCGTCCCACG TACTGCGCGC CATGGGTGTT
CCCTTTACCT GCGCCCACGG CTCCATCCGC TTCTCGCTCT CCAGGTTCAC CACCGACGCC
GAGATCGACG CCGTCATCGA AGCTTTGCCG CCGATCATCA GCCGCCTGCG CCAGATGTCG
CCGTTTGGCA GGGAGTTCCT GAACAAATAG
 
Protein sequence
MKEIYLDNNA TTMVDERVFE EMRPYFCELY GNPSSMHFFG GQVQKKVDEA RSRVASLLGA 
LPDEIVFTAC GTESDNAAIR SALEVFPEKR HIITSRVEHP AVLTQCRNLT KRGYRVTELN
VDGNGQLDLK ELEAALDDDT VIVSLMYANN ETGVIFPIEE AARMVKAKGA LFHTDAVQAV
GKIPLNMAES AIDLLSLSGH KLHAPKGVGV LYVRRGTPFR PLLVGGHQER GRRAGTENTA
SIIAMGKACE LAHLHMPEEA GRVREMRDRL ERELTALIPN TRINGGGTDR LPNTLSIAME
FVEGEGILLL LSEKGICASS GSACTSGSLE PSHVLRAMGV PFTCAHGSIR FSLSRFTTDA
EIDAVIEALP PIISRLRQMS PFGREFLNK