Gene Gdia_3312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3312 
Symbol 
ID6976752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3622255 
End bp3623502 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID643392823 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002277654 
Protein GI209545425 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0757647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCT CGCCCGCCGT CATGCCCGAC CCGATCGACG CCCTGCGCGC CCGCCGCGAC 
GATTTCCCGA TCCTGAACGA GAGGGTGCAC GGCAAGCCGC TGGTCTTCCT GGACAGCGCG
GCCTCGGCCC AGAAGCCGCT TCCGGTCATC GAGGCGATGG CGGAGACGAT GCGTACCCAG
TACGCCAACA TCCATCGCGG CCTGCACTGG ATGAGCGAGC GGACCACCGA CGCGTACGAG
GGCGTGCGCG ACCAGGTCGC CGGCCTGATC GGTGCGGCGC GCGAGGAAAT CATCTTCACG
CGCAACAGCA CCGAGGCGAT CAACCTGGTC GCCCATTCCT TCGGCAGCCT GATGCGTCCG
GGCCAGGCGG TCGTGATCTC GGAAATGGAG CATCACGCCA ACCTGGTGCC CTGGCAGATG
CTGCGCGACC GCGCGGGCAT CGAACTGCGC GTGGCGCCGA TCAGCGATTC CGGCGACCTG
GAACTGGATG CCCTGGCGCG GCTGCTGGAT GACGGCAAGG TGGCGCTGGT GGCCGTCACC
CACATGTCCA ACGTGCTGGG CACCATCACC CCGGCGCGCA AGATCGCGGA CATCGCGCAT
GCCGCCGGTG CGCGGGTGCT GTTCGACGGC AGCCAGATGG TGGTCCATCA CCGGGTGGAC
GTGCGGGCGA TCGACGCCGA TTTCTACACC TTCACCGGGC ACAAGCTGTA CGGCCCCACG
GGCATCGGCG TGCTGTGGGG GCGGCGCGAA CTGCTGGAGG AAATGCCGCC CTTCCTGGGC
GGGGGCGACA TGATTTCCTC CGTCCGGTTC GAGGGATCGA GCTGGGCGAC CGTGCCCCAC
AAGTTCGAGG CCGGCACGCC CGCCATTATC GAGACCATCG GGCTGGGGGC CGCCATCTCC
TACGTCGAAT CGGTGGGATA TGACGCCATC GCGGCGCATG AATCCGCGCT GCTGGACCAT
GCGCTGCGGC GGCTGGGCGA GGTGCCCGGC CTGCACGTGG TGGGGTCGCC GGTCGAACGC
GGCGGCGTGA TCTCGTTCAC CATGGACGAC GTGCATCCGC ATGACATCGC CACCCTGCTG
GACCGGAACG GCATCGCGAT CCGGGCCGGC CATCATTGCG CGGAACCGCT GATGCGGCGC
CTGGGGCTGT CCGCCACCGC GCGGGCCAGC TTCGGCCTCT ATACGACGCG CGAGGAGGTG
GACGCTCTGG CCAGGACGCT GGAGCAGATC CGCTCCTTCT TCCTGTAA
 
Protein sequence
MDVSPAVMPD PIDALRARRD DFPILNERVH GKPLVFLDSA ASAQKPLPVI EAMAETMRTQ 
YANIHRGLHW MSERTTDAYE GVRDQVAGLI GAAREEIIFT RNSTEAINLV AHSFGSLMRP
GQAVVISEME HHANLVPWQM LRDRAGIELR VAPISDSGDL ELDALARLLD DGKVALVAVT
HMSNVLGTIT PARKIADIAH AAGARVLFDG SQMVVHHRVD VRAIDADFYT FTGHKLYGPT
GIGVLWGRRE LLEEMPPFLG GGDMISSVRF EGSSWATVPH KFEAGTPAII ETIGLGAAIS
YVESVGYDAI AAHESALLDH ALRRLGEVPG LHVVGSPVER GGVISFTMDD VHPHDIATLL
DRNGIAIRAG HHCAEPLMRR LGLSATARAS FGLYTTREEV DALARTLEQI RSFFL