Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3312 |
Symbol | |
ID | 6976752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3622255 |
End bp | 3623502 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643392823 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_002277654 |
Protein GI | 209545425 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0757647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTCT CGCCCGCCGT CATGCCCGAC CCGATCGACG CCCTGCGCGC CCGCCGCGAC GATTTCCCGA TCCTGAACGA GAGGGTGCAC GGCAAGCCGC TGGTCTTCCT GGACAGCGCG GCCTCGGCCC AGAAGCCGCT TCCGGTCATC GAGGCGATGG CGGAGACGAT GCGTACCCAG TACGCCAACA TCCATCGCGG CCTGCACTGG ATGAGCGAGC GGACCACCGA CGCGTACGAG GGCGTGCGCG ACCAGGTCGC CGGCCTGATC GGTGCGGCGC GCGAGGAAAT CATCTTCACG CGCAACAGCA CCGAGGCGAT CAACCTGGTC GCCCATTCCT TCGGCAGCCT GATGCGTCCG GGCCAGGCGG TCGTGATCTC GGAAATGGAG CATCACGCCA ACCTGGTGCC CTGGCAGATG CTGCGCGACC GCGCGGGCAT CGAACTGCGC GTGGCGCCGA TCAGCGATTC CGGCGACCTG GAACTGGATG CCCTGGCGCG GCTGCTGGAT GACGGCAAGG TGGCGCTGGT GGCCGTCACC CACATGTCCA ACGTGCTGGG CACCATCACC CCGGCGCGCA AGATCGCGGA CATCGCGCAT GCCGCCGGTG CGCGGGTGCT GTTCGACGGC AGCCAGATGG TGGTCCATCA CCGGGTGGAC GTGCGGGCGA TCGACGCCGA TTTCTACACC TTCACCGGGC ACAAGCTGTA CGGCCCCACG GGCATCGGCG TGCTGTGGGG GCGGCGCGAA CTGCTGGAGG AAATGCCGCC CTTCCTGGGC GGGGGCGACA TGATTTCCTC CGTCCGGTTC GAGGGATCGA GCTGGGCGAC CGTGCCCCAC AAGTTCGAGG CCGGCACGCC CGCCATTATC GAGACCATCG GGCTGGGGGC CGCCATCTCC TACGTCGAAT CGGTGGGATA TGACGCCATC GCGGCGCATG AATCCGCGCT GCTGGACCAT GCGCTGCGGC GGCTGGGCGA GGTGCCCGGC CTGCACGTGG TGGGGTCGCC GGTCGAACGC GGCGGCGTGA TCTCGTTCAC CATGGACGAC GTGCATCCGC ATGACATCGC CACCCTGCTG GACCGGAACG GCATCGCGAT CCGGGCCGGC CATCATTGCG CGGAACCGCT GATGCGGCGC CTGGGGCTGT CCGCCACCGC GCGGGCCAGC TTCGGCCTCT ATACGACGCG CGAGGAGGTG GACGCTCTGG CCAGGACGCT GGAGCAGATC CGCTCCTTCT TCCTGTAA
|
Protein sequence | MDVSPAVMPD PIDALRARRD DFPILNERVH GKPLVFLDSA ASAQKPLPVI EAMAETMRTQ YANIHRGLHW MSERTTDAYE GVRDQVAGLI GAAREEIIFT RNSTEAINLV AHSFGSLMRP GQAVVISEME HHANLVPWQM LRDRAGIELR VAPISDSGDL ELDALARLLD DGKVALVAVT HMSNVLGTIT PARKIADIAH AAGARVLFDG SQMVVHHRVD VRAIDADFYT FTGHKLYGPT GIGVLWGRRE LLEEMPPFLG GGDMISSVRF EGSSWATVPH KFEAGTPAII ETIGLGAAIS YVESVGYDAI AAHESALLDH ALRRLGEVPG LHVVGSPVER GGVISFTMDD VHPHDIATLL DRNGIAIRAG HHCAEPLMRR LGLSATARAS FGLYTTREEV DALARTLEQI RSFFL
|
| |