Gene Nmul_A0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0617 
Symbol 
ID3784413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp698972 
End bp700189 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content54% 
IMG OID637810699 
Productpoly-gamma-glutamate synthesis protein (capsule biosynthesis protein) 
Protein accessionYP_411316 
Protein GI82701750 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTAG GTCCGCTCTA CGCCCAAGCC GTGCGGGACG GTGTCCAGAT TTCTGTAAAG 
AATGGGGATC AGACCGCTCC TGGTCCTCGA TCGCCGGGCG GACATACTTC GATCAAGTTA
TTTCTCTGTG GCGATGTCAT GACGGGCAGA GGCATAGACC AGGTCTTGCC TCACCCCGGC
AATCCCATTC TTTTTGAGGG GTACATGAAA AGCGCAACAG GCTATGTTGA GCTTGCCGAA
GAGGCAAACG GGCCGATTCC GCACCCAGTC CCTTTTTCCT ATATCTGGGG AGATGCGCTT
GCCGAACTGG AGCGTAGAAA GCCCGATGTT CGCGTCATTA ACCTCGAAAC CGCGCTTACC
CGTAGTGACG AAGTGCAGGA CAAGGCAGTG AACTACCGCA TGAATCCGGA TAATATCCCC
TGTATCACTG CCGCGAAGAT CGATTGCTGC GTGCTGGCGA ACAACCATGT TCTGGACTGG
GGGTATGAAG GTCTCGCCGA GACGTTGAAA ACGCTCAAAC GAGCCGACAT AAAAATAGCT
GGCGCCGGCT TAAACGTTCA AGAGGCGCGG CAACCCGCGG AGATGACAGT TCCCGGAAAG
GGACGTGTAC TGGTTTTCTC GCTCGGATCG GAAACAAGCG GCATACCCTG GAACTGGGCG
GCTCGAACAG ACCGGGCAGG CGTGAATCTG TTGCCGGATT TTTCAGCAAA AACGGTTCGG
GAAATTCGCG ACAGAATAAA GCAGGTCCGC CTGCCCGGTG ACATTGTGGT TGCTTCAATC
CATTGGGGTA ATAACTGGGG TTATGCAATT CCAGTCGAGC AACAGGATTT TGCACACGGC
CTGATCGATG AAGCGGGTGT TGACGTCATC CACGGCCATT CATCGCATCA TGTGAAGGGC
ATCGAGGTTT ACAGGGGAAA GCTCATTCTT TATGGATGCG GCGACTTCCT GAATGATTAT
GAAGGCATCT CAGGGCACGA GACTTATCGG GGTGACCTGA CGTTGATGTA TTTCGTGAGC
GCGGAGCCGC AGACCGGCAA ACTCGTCAGC CTGTCGATGG TGCTCATGCA GGTCAGGCAT
TTCAAATTGA ATCGGGCATC TGATGTCGAT GCTTCCTGGC TGAAGAATAT CCTGAACAGG
GAGGGGAAGA AGCTGGGGAC GTCGGTGGAA CTGACGGCGG ATAATACCTT GATGCTCCGA
TGGATGCTCC AAGGATAG
 
Protein sequence
MTVGPLYAQA VRDGVQISVK NGDQTAPGPR SPGGHTSIKL FLCGDVMTGR GIDQVLPHPG 
NPILFEGYMK SATGYVELAE EANGPIPHPV PFSYIWGDAL AELERRKPDV RVINLETALT
RSDEVQDKAV NYRMNPDNIP CITAAKIDCC VLANNHVLDW GYEGLAETLK TLKRADIKIA
GAGLNVQEAR QPAEMTVPGK GRVLVFSLGS ETSGIPWNWA ARTDRAGVNL LPDFSAKTVR
EIRDRIKQVR LPGDIVVASI HWGNNWGYAI PVEQQDFAHG LIDEAGVDVI HGHSSHHVKG
IEVYRGKLIL YGCGDFLNDY EGISGHETYR GDLTLMYFVS AEPQTGKLVS LSMVLMQVRH
FKLNRASDVD ASWLKNILNR EGKKLGTSVE LTADNTLMLR WMLQG