Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3877 |
Symbol | |
ID | 8449496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4270143 |
End bp | 4271159 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042925 |
Product | Cysteine synthase |
Protein accession | YP_003203161 |
Protein GI | 258654005 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00690211 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0462514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCC GGAGACCGCT CAAGGGCATT CTCTCCGCGA TCGGGGACAC CCCGCTGGTG GAGATGGGAT GCCTCGTTCC GGGTTTCGAC TTCCGGCTGT TCGCCAAGAT GGAGCGGTTC AACCCGGGTG GCTCGGTGAA GGATCGATCC GCGCTGGCCA TGCTGCAGGC CAAGATTCTC GACGGCGGGG TCCGGCCCGG CCGGACCGTG GTGATCGAAT CCAGCTCCGG CAACCTGGCC ATCGGTCTGG CCCAGATCTG CTGCTACTAC GGCATCGATC TGATCTGCGT CGTCGATGCC AAGACGACCA CGCAGAACCT GGCCATCCTG CGGGCCTACG GCGCGCGGGT CGAGGTGGTG ACCGACCGGG ATCCGGCCAC CGGCGAGTAC CTGCCCGAAC GGGTCCGTCG GGTCCGCCAC CTGCTCGACA CCCTGCCCCA CGCCTACTCG CCCAACCAGT ACGCGAACCT GCGCAATCCG GCCGCCCACG AGAACACCAT GCGCGAGATC GCCGAGGCGC TGGACGGCCG GGTGGACTTC CTGTTCGCCG CGGCGGGTAC CTTCGGCACG CTGCGCGGCT GCGTCGGATA CCTGCGGGCG CAAGGCCTGC CGACCCGGGT GATCGCGGTG GATGCGGTGG GCAGTGTCCT GTTTCACACC ACCCCCGGAC GCCGGCTGAT TCCCGGGCAT GGCGCGGCGA TCCGGCCGGC GTTGCTGGAC CCGAGCCTGG TCGACGACGT CGTGCACGTC ACCGACCTGG AGTGCATCGT GGCCTGCCGG GCCCTGACCC GGCAGGAGGG CATCCTGGCC GGTGGCTCGT CCGGGGCGAC GATCGCCGCG GTCCGCCGGT TCGCCCCGCG CATCCCGGCC GGCTCGACGG TGGTGGCGAT CTTTCCCGAC GGTGGCGACC GCTACCTGGA CACCATCTAT TCCGACCCCT GGGTGGCCGA GCACTTCGGC GCCGACGCGC TGCAACTCTG GCAGCGCCCG ATCATCGAGG ACGTGGCCTG GTGCTGA
|
Protein sequence | MAVRRPLKGI LSAIGDTPLV EMGCLVPGFD FRLFAKMERF NPGGSVKDRS ALAMLQAKIL DGGVRPGRTV VIESSSGNLA IGLAQICCYY GIDLICVVDA KTTTQNLAIL RAYGARVEVV TDRDPATGEY LPERVRRVRH LLDTLPHAYS PNQYANLRNP AAHENTMREI AEALDGRVDF LFAAAGTFGT LRGCVGYLRA QGLPTRVIAV DAVGSVLFHT TPGRRLIPGH GAAIRPALLD PSLVDDVVHV TDLECIVACR ALTRQEGILA GGSSGATIAA VRRFAPRIPA GSTVVAIFPD GGDRYLDTIY SDPWVAEHFG ADALQLWQRP IIEDVAWC
|
| |