Gene M446_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3581 
Symbol 
ID6134182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3998050 
End bp3999264 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content73% 
IMG OID641643748 
Productcysteine desulfurase NifS 
Protein accessionYP_001770396 
Protein GI170741741 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0285875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CCCACACCAC GGCCTATCTC GACAACAACG CCACCACCCG GGTCGATTCC 
CGGGTCGTGG AGGCCATGCT CCCGTTCCTC ACCGAGCATT TCGGCAACGC CTCCTCCATG
CACGCCTTCG GGGCGGCGGT GGGCGGGGCC GTGCGGGCGG CGCGGGGCGA GGTCCAGGCG
CTGCTCGGCG CCGCCCACGA TTCCGAGATC GTCTTCACCT CGGGCGGCAC CGAGAGCGAC
AACACCGCGA TCCTCTCCGC CCTGGAGGTG AGCCCGCGGC GGCGGGAGAT CGTCACCAGC
GCGGTCGAGC ACCCGGCCGT GCTCTCCCTG TGCAGCCACC TGGAGAAGAA CCGGGGCATC
AAGGTCCACG TCATCCCGGT CGACGGGAAG GGGCGGCTCG ACCGGGCGGC CTACAGCGCG
GCCCTGTCGG AGCGGGTCGC CGTGGTGTCG ATCATGTGGG CCAACAACGA GACCGGGACG
ATCTTCCCGG TCGCGGACCT CGCCGAGGAG GCGAAGGCGC ACGGGGCGAT GTTCCACACC
GACGCGGTGC AGGCGGTGGG CAAGGTGCCG ATCGACCTCA AGGCGACGGC GATCGACATG
CTGTCGCTCT CGGCCCACAA GCTGCACGCG CCGAAGGGCG TCGGGGCGCT CTACCTGCGG
CGCGGCCTGC GCGTCCGGCC GCTCCTGCGC GGCGGCCACC AGGAGCGGGG CCGGCGCGCC
GGCACCGAGA ACATCCCGGG CATCGTCGCC CTCGGCGCGG CGGCGCGGAT CGCCGCGGAG
GGGCTCGCCG CGGACGCGAT CCGGGTCGGC GCCCTGCGCG ACCGGCTGGA GAAGGGCCTG
CTGCAGCGCA TCCCGCACTG CTTCGTCACC GGCGACCCGG ATCACCGCCT GCCCAACACC
GCCAACGTCG CCTTCGCGTA TATCGAGGGC GAGGGCATCC TGCTCCTGCT CAACCGGGCG
GGGATCGCCG CCTCCTCGGG CTCGGCCTGC ACCTCGGGCT CGCTCGAACC CTCCCACGTG
CTGCGCGCCA TGAAGGTGCC CGCCACGGCG GCGCACGGGG CGATCCGCTT CTCGCTCTCG
CGCGAGACGA CGGGCGAGGA GGTCGACCGG GTGCTCGAGG CCATGCCGGG CATCGTCGGC
AAGCTGCGCG ACCTGTCCCC GTTCTGGAGC GGGACGGGAA GCGAGGCCGC GTCCTTCAAT
CCCGTCTACG CCTGA
 
Protein sequence
MTATHTTAYL DNNATTRVDS RVVEAMLPFL TEHFGNASSM HAFGAAVGGA VRAARGEVQA 
LLGAAHDSEI VFTSGGTESD NTAILSALEV SPRRREIVTS AVEHPAVLSL CSHLEKNRGI
KVHVIPVDGK GRLDRAAYSA ALSERVAVVS IMWANNETGT IFPVADLAEE AKAHGAMFHT
DAVQAVGKVP IDLKATAIDM LSLSAHKLHA PKGVGALYLR RGLRVRPLLR GGHQERGRRA
GTENIPGIVA LGAAARIAAE GLAADAIRVG ALRDRLEKGL LQRIPHCFVT GDPDHRLPNT
ANVAFAYIEG EGILLLLNRA GIAASSGSAC TSGSLEPSHV LRAMKVPATA AHGAIRFSLS
RETTGEEVDR VLEAMPGIVG KLRDLSPFWS GTGSEAASFN PVYA