Gene Nmul_A0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0909 
Symbol 
ID3784956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1033305 
End bp1034543 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content56% 
IMG OID637810991 
Productglutamate--cysteine ligase, GCS2 
Protein accessionYP_411604 
Protein GI82702038 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.237663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCGG CATTGCCGGC CTTTACAGGT TATGGCATCG AACTGGAGTA CATGATTGTC 
GACCGGAATA CGCTCGCGAT TGTTCCTATT GCCGACGAGT TGCTGCGAAA GCTTACCGGC
GCGTTTGCAT CCGAGGCCGG GAACGGCCAG TTGGGATGGT CGAACGAGCT GGTGCTGCAT
CTGCTCGAAC TGAAAAACAT CGACCCGCAA CCGGACATTT CCTCCCTGTC TGTCGCGTTT
CAAGGCGAAA TCCGGCGGAT CAGCAAGGTG CTCGAATCGA TGAATGCACG GCTCATGCCG
ACCGGCATGC ACCCCTGGAT GAATCCCCGC ACGGAGACCC GCCTTTGGCC GCACGAGCAG
GCGGAAATCT ATCTGGCCTA TGACCATATC TTCGATTGCA GAAGACATGG CTGGGCCAAC
CTGCAAAGCA TGCACCTGAA CATGCCTTTC GCAAACGACG ATGAATTTGC GCGGCTTCAC
GCCGCGGTGC GGCTGCTGCT GCCCATTCTT CCAGCAGTAG CGGCAAGCTC CCCGATCGCC
GAAGGCAGCT ATTCTGGTTG CCTGGATTTC CGGATGGCAA ATTACTGCGA GCACCAGCTG
AAGGTACCCT CGACCATCGG CCGGGTAATC CCGGAAACAG TCTCCAGTGC CGCCGCCTAT
GAAGAAAAGA TTCTGGCGCC GATGTACCGC GAAATTGCAG CACTCGATCC GGAAGAAATG
CTACAGCATG AGTGGCTCAA TGCGCGCGGG GAAATTCCCC GCTTTGACCG TAACGCTATC
GAGATCCGCG TAATCGATAC CCAGGAATGC TCGAGTGCTG ACTTTGCCAT CGCCGCTGCT
GCGATCAATA TCGTGCATGC GCTCTATAGC GAAGCTTACG CGTCGCTGGT CGCGCAGCAA
TCCATCGCCA CGGAAGCATT GGTCAGGATC ATGCATGCCT GCATCCGTGA CGCAGAGCAA
GCCGTAATCG ATAATGTGGC ATACCTTCGC CTGATGGGTT TTCCGGGTAC GGATTGCACG
GCTGGCGTTC TGTGGCGACA TCTTGTCGAG TCTGAGATGC CAGTCGAATC CAGGCAAGGA
AAAAGCTGCC GAGAAGCGCT GCAAACCATT TTAAGAGAGG GCTCGCTTGC GCGCCGCATC
CTGAATGCGA TCGGTTCCGA CTTCAGCAGG AGACGGTTGG AGACGGTCTA CCGCGAGCTG
TGCGATTGTC TGGACGAAAA TCGCATGTTT CTGAAATGA
 
Protein sequence
MTAALPAFTG YGIELEYMIV DRNTLAIVPI ADELLRKLTG AFASEAGNGQ LGWSNELVLH 
LLELKNIDPQ PDISSLSVAF QGEIRRISKV LESMNARLMP TGMHPWMNPR TETRLWPHEQ
AEIYLAYDHI FDCRRHGWAN LQSMHLNMPF ANDDEFARLH AAVRLLLPIL PAVAASSPIA
EGSYSGCLDF RMANYCEHQL KVPSTIGRVI PETVSSAAAY EEKILAPMYR EIAALDPEEM
LQHEWLNARG EIPRFDRNAI EIRVIDTQEC SSADFAIAAA AINIVHALYS EAYASLVAQQ
SIATEALVRI MHACIRDAEQ AVIDNVAYLR LMGFPGTDCT AGVLWRHLVE SEMPVESRQG
KSCREALQTI LREGSLARRI LNAIGSDFSR RRLETVYREL CDCLDENRMF LK