Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0909 |
Symbol | |
ID | 3784956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1033305 |
End bp | 1034543 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810991 |
Product | glutamate--cysteine ligase, GCS2 |
Protein accession | YP_411604 |
Protein GI | 82702038 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.237663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCGG CATTGCCGGC CTTTACAGGT TATGGCATCG AACTGGAGTA CATGATTGTC GACCGGAATA CGCTCGCGAT TGTTCCTATT GCCGACGAGT TGCTGCGAAA GCTTACCGGC GCGTTTGCAT CCGAGGCCGG GAACGGCCAG TTGGGATGGT CGAACGAGCT GGTGCTGCAT CTGCTCGAAC TGAAAAACAT CGACCCGCAA CCGGACATTT CCTCCCTGTC TGTCGCGTTT CAAGGCGAAA TCCGGCGGAT CAGCAAGGTG CTCGAATCGA TGAATGCACG GCTCATGCCG ACCGGCATGC ACCCCTGGAT GAATCCCCGC ACGGAGACCC GCCTTTGGCC GCACGAGCAG GCGGAAATCT ATCTGGCCTA TGACCATATC TTCGATTGCA GAAGACATGG CTGGGCCAAC CTGCAAAGCA TGCACCTGAA CATGCCTTTC GCAAACGACG ATGAATTTGC GCGGCTTCAC GCCGCGGTGC GGCTGCTGCT GCCCATTCTT CCAGCAGTAG CGGCAAGCTC CCCGATCGCC GAAGGCAGCT ATTCTGGTTG CCTGGATTTC CGGATGGCAA ATTACTGCGA GCACCAGCTG AAGGTACCCT CGACCATCGG CCGGGTAATC CCGGAAACAG TCTCCAGTGC CGCCGCCTAT GAAGAAAAGA TTCTGGCGCC GATGTACCGC GAAATTGCAG CACTCGATCC GGAAGAAATG CTACAGCATG AGTGGCTCAA TGCGCGCGGG GAAATTCCCC GCTTTGACCG TAACGCTATC GAGATCCGCG TAATCGATAC CCAGGAATGC TCGAGTGCTG ACTTTGCCAT CGCCGCTGCT GCGATCAATA TCGTGCATGC GCTCTATAGC GAAGCTTACG CGTCGCTGGT CGCGCAGCAA TCCATCGCCA CGGAAGCATT GGTCAGGATC ATGCATGCCT GCATCCGTGA CGCAGAGCAA GCCGTAATCG ATAATGTGGC ATACCTTCGC CTGATGGGTT TTCCGGGTAC GGATTGCACG GCTGGCGTTC TGTGGCGACA TCTTGTCGAG TCTGAGATGC CAGTCGAATC CAGGCAAGGA AAAAGCTGCC GAGAAGCGCT GCAAACCATT TTAAGAGAGG GCTCGCTTGC GCGCCGCATC CTGAATGCGA TCGGTTCCGA CTTCAGCAGG AGACGGTTGG AGACGGTCTA CCGCGAGCTG TGCGATTGTC TGGACGAAAA TCGCATGTTT CTGAAATGA
|
Protein sequence | MTAALPAFTG YGIELEYMIV DRNTLAIVPI ADELLRKLTG AFASEAGNGQ LGWSNELVLH LLELKNIDPQ PDISSLSVAF QGEIRRISKV LESMNARLMP TGMHPWMNPR TETRLWPHEQ AEIYLAYDHI FDCRRHGWAN LQSMHLNMPF ANDDEFARLH AAVRLLLPIL PAVAASSPIA EGSYSGCLDF RMANYCEHQL KVPSTIGRVI PETVSSAAAY EEKILAPMYR EIAALDPEEM LQHEWLNARG EIPRFDRNAI EIRVIDTQEC SSADFAIAAA AINIVHALYS EAYASLVAQQ SIATEALVRI MHACIRDAEQ AVIDNVAYLR LMGFPGTDCT AGVLWRHLVE SEMPVESRQG KSCREALQTI LREGSLARRI LNAIGSDFSR RRLETVYREL CDCLDENRMF LK
|
| |