Gene Nmul_A0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0631 
Symbol 
ID3784427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp717475 
End bp719487 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content56% 
IMG OID637810713 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_411330 
Protein GI82701764 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCGC CCAAACTCAA CATAAGGACG GTTATTGCAT TCGGGCATGA TGTTGTAGCT 
GCCGCCATCG CCTGGTCCCT GGCTTATCTG TTCCGCTTCA ATTTCGAGAT ACCCTTCGTT
TATCTGGCAT CTTTGGAAGA GATTCTGCCG TGGGTGGTCC CGATACACGC AGCAGCCTTC
TTATTCTTTG GCCTGTATCG TGGACTCTGG CATTACGCAA GCCTGCCCGA TTTGCGGCGC
ATCCTGTTTG CTGTATCCGC TTCGGCTGCA GCCGTCCCAT TGGTGCTATA CATGTTGCAG
ATACTGGTCG GTGTCCCCAG AACTGTGCTG ATACTGGCCC CCATCCTGCT GCTGTTCATC
ATGGGAAGCA GCCGTCTCGC TTACCGGTTC TGGAAAGAAC ACCGGCTTTA CGGCCGCAGT
AAAACGGGAG GCAATCTCGT TCTGGTGATA GGAGCAAGTG ATGCTGCTGT CGGCTTGGTA
AAAGAGCTGG CGCGCAATGT GGAATGGCGC GTCGCGGGTT TCCTTGACGA CGATCCGGCC
AAGCGGGGGC TGATGCTGCA CGGATTCAAA GTACTGGGCC GCATCAACGA TCTGCCGGAG
GTAGCCCGGA AACTGGGTGT CGCGCATGCC ATTATCGCTC TCACCTCCTC CGCTTCAGAT
CGCCGCAAAT CCTATCGCAC CCACTCTGAT CGCCGCCGCC CCGACCGCCT CCTGCGCGAC
CGACGTCGAG CCTTGCAACT GTGTGCAGCG GCAGGGGTAA AGGCCTTGAT CGTCCCTTCC
TATAACGACC TGGTGAGCGG CAACATCAAG GTGTCACAAA TCCGGACAGT CGAGCCTGAA
GATTTGCTGG GGCGGGATCC TGTCGTGCTG GATAACGATG GACTGCACGA TTTACTGACG
GGAAAAACGG TATTGGTTAC AGGCGCGGGG GGTTCGATTG GATCGGAGTT ATGCCGACAG
ATCGTCAAAT TCGCACCGGC TCAACTGGTA CTGTTTGAAT TGAATGAATT TGCGCTGTAC
AGCATAGAGC AGGAATTCCA GGCCGATTTT CCAGAGATTC CCATGATATT CGCCATTGGC
GATGTCAAGG ATGAAGCGCG GCTATCACAG GTATTCCTGC AATACCGGCC GGCTATTGTG
TTTCATGCGG CTGCCTACAA GCATGTGCCC CTGATGGAAC AGGAAAATGC ATGGCAGGCG
GTGCTGAACA ACGTGCGGGG AACCTATGTC CTGGCGCAAA CCGCCATCCG GTACGGGGTG
GAGAAATTCG TCCTGATTTC GACCGACAAG GCCGTCAACC CGACCAATGT CATGGGCGCG
AGTAAACGCC TGGCTGAAAT GGTATGCCAG GCGTTACAGC AGTCGATTGC ATCTCCTGAC
AGTCCCACCG AAAACGCCTC CGGCAAAATT GTTCTGGGCA GCAGGCTTGA AAGGAGGCAG
GAGCCACGAT TCGTCATGGT GCGTTTCGGT AATGTGCTGG GCAGTGCGGG CAGCGTGATT
CCCAAATTCA GGGAGCAGAT TGAAAAAGGG GGACCGATAA CGGTAACGCA CTCGGAAATC
ACCCGTTATT TCATGTCCAT ACCTGAGGCA GCACAGTTGG TTCTACAGGC CGGCTTGATG
GGCGGCAAGC GGCGGGGGGG GGAAATTTTC GTGCTGGATA TGGGCGAACC GATTAAAATC
GCCGATCTTG CGCGAGATCT GATTCGTCTT TCCGGATTGA GCGAGGATGA AATAAAAATT
GTTTACAATG GCTTGCGCCC CGGCGAAAAA CTCTACGAAG AACTGTTGGC GGATGACGAA
AATACGCTTC CCACGCCCCA CCCCAAATTG CGTATCGCGC AAGCACGTCA GGTAGACAGA
AAATGGCTGA CAGCTCTGCT GGCGTGGCTT GAGGAGCATC CCGCCCTGAG CGACGAAGAG
GTCAAGCGGG AGCTGCCCAG ATGGGTTCCG GAATACGTGA GCGCGGAATC AGTTATCCCC
GTTTCCCAGG TTCGCGAGTC TCGTATCGCA TAG
 
Protein sequence
MPAPKLNIRT VIAFGHDVVA AAIAWSLAYL FRFNFEIPFV YLASLEEILP WVVPIHAAAF 
LFFGLYRGLW HYASLPDLRR ILFAVSASAA AVPLVLYMLQ ILVGVPRTVL ILAPILLLFI
MGSSRLAYRF WKEHRLYGRS KTGGNLVLVI GASDAAVGLV KELARNVEWR VAGFLDDDPA
KRGLMLHGFK VLGRINDLPE VARKLGVAHA IIALTSSASD RRKSYRTHSD RRRPDRLLRD
RRRALQLCAA AGVKALIVPS YNDLVSGNIK VSQIRTVEPE DLLGRDPVVL DNDGLHDLLT
GKTVLVTGAG GSIGSELCRQ IVKFAPAQLV LFELNEFALY SIEQEFQADF PEIPMIFAIG
DVKDEARLSQ VFLQYRPAIV FHAAAYKHVP LMEQENAWQA VLNNVRGTYV LAQTAIRYGV
EKFVLISTDK AVNPTNVMGA SKRLAEMVCQ ALQQSIASPD SPTENASGKI VLGSRLERRQ
EPRFVMVRFG NVLGSAGSVI PKFREQIEKG GPITVTHSEI TRYFMSIPEA AQLVLQAGLM
GGKRRGGEIF VLDMGEPIKI ADLARDLIRL SGLSEDEIKI VYNGLRPGEK LYEELLADDE
NTLPTPHPKL RIAQARQVDR KWLTALLAWL EEHPALSDEE VKRELPRWVP EYVSAESVIP
VSQVRESRIA